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PRODUCTION OF A TRANSGENIC AVIAN BY CYTOPLASMIC 

INJECTION 

The present application claims priority from U.S. Patent Application No. 
10/790,455, filed March 1, 2004; U.S. provisional patent application Serial No. 
60/490,452, filed July 28, 2003; U.S. provisional patent application Serial No. 
60/536,677, filed January 15, 2004; and U.S. provisional patent application Serial 
No. 60/458,014, filed March 27, 2003. 

Field of the Invention 

The present invention relates to the field of biotechnology, and more 
specifically to the field of avian genome modification. Disclosed herein are 
compositions, vectors, and methods of use thereof, for the generation of genetically 
transformed avian cells and transgenic birds. 

Background 

Transgenic technology to convert animals into "bioreactors" for the 
production of specific proteins or other substances of pharmaceutical interest 
(Gordon et al, 1987, Biotechnology 5: 1183-1187; Wilmut et al.. 1990, 
Theriogenology 33: 113-123) offers significant advantages over more conventional 
methods of protein production by gene expression. Recombinant nucleic acid 
molecules, for instance, have been engineered and incorporated into transgenic 
animals so that an expressed heterologous protein may be joined to a protein or 
peptide that allows secretion of the transgenic expression product into milk or urine, 
from which the protein may then be recovered. These procedures, however, may 
require lactating animals, with the attendant costs of maintaining individual animals 
or herds of large species, such as cows, sheep, or goats. 

Historically, transgenic animals have been produced almost exclusively by 
microinjection of the fertilized egg. The pronuclei of fertilized eggs are 
microinjected in vitro with foreign, i.e., xenogeneic or allogeneic, heterologous 
DNA or hybrid 

1 

SUBSTITUTE SHEET (RULE 26) 



wo 2004/092351 



PCT/US2004/009253 



DNA molecules. The microinjected fertilized eggs are then transferred to the genital 
tract of a pseudopregnant female (e.g., Krimpenfort et al, U.S. Pat. No. 5,175,384). 

One system that holds potential is the avian reproductive system. The 
production of an avian egg begins with formation of a large yolk in the ovary of the 
5 hen. The unfertilized oocyte or ovum is positioned on top of the yolk sac. After 
ovulation, the ovum passes into the infiindibulum of the oviduct where it is fertilized 
if sperm are present, and then moves into the magnum of the oviduct, which is lined 
with tubular gland cells. These cells secrete the egg-white proteins, including 
ovalbumin, lysozyme, ovomucoid, conalbumin and ovomucin, into the lumen of the 

10 magnum where they are deposited onto the avian embryo and. yolk. The hen oviduct 
offers outstanding potential as a protein bioreactor because of the high levels of 
protein production, the promise of proper folding and post-translation modification of 
the target protein, the ease of product recovery, and the shorter developmental period 
of chickens compared to other potential animal species. 

IS One method for creating permanent genomic modification of a eukaryotic cell 

is to integrate an introduced DNA into an existing chromosome. Only retroviruses 
have so far provided efficient integration. However, retroviral integration is directed 
to a number, albeit limited, of insertion sites within the recipient genome so that 
positional variation in heterologous gene expression can be evident. Unpredictability 

20 as to which insertion site is targeted introduces an undesirable lack of control over the 
procedure. An additional limitation of the use of retroviruses is that the size of the 
nucleic acid molecule encoding the virus and heterologous sequences is restricted to 
about 8 kb. Although wild-type adeno-associated virus (AAV) often integrates at a 
specific region in the human genome, vectors derived fi-om AAV do not integrate site- 

25 specifically due to the deletion of the toxic rep gene. Other well-known methods for 
genomic modification of animal cells include transfection of DNA using calcium 
phosphate co-precipitation, electroporation, lipofection, microinjection, protoplast 
fusion and particle bombardment, all of which methods typically produce random 
integration and at low frequency. Homologous recombination produces site-specific 

30 integration, but the fi^quency of such integration usually is very low. 
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An alternative method that has been considered for driving the integration of 
heterologous nucleic acid fragments into a chromosome is the use of a site-specific 
recombinase (integrase) that can catalyze the insertion or excision of nucleic acid 
fragments. These enzymes recognize relatively short unique nucleic acid sequences 
5 that serve for both recognition and recombination. Examples include Cre (Sternberg & 
Hamilton, 1981, J. Mol. Biol. 150: 467-486, 1981), Flp (Broach et al, 1982, Cell 29: 
227-234, 1982)andR(Matsuzakiefa/., \990, J. Bact. 172: 610-618, 1990). 

A novel class of phage integrases that includes the integrase from the phage 
phiC31 can mediate highly efficient integration of transgenes in mammalian cells both 
10 in vitro and in vivo (Thyagarajan et al., Mol. Cell Biol. 21: 3926-3934 (2001)). 
Constructs and methods of using recombinase to integrate heterologous DNA into a 
plant, insect or mammalian genome are described by Calos in U.S. Patent Serial No. 
6,632,672. 

The phiC31 integrase is a member of a subclass of integrases, termed serine 
15 recombinases, that include R4 and TP901-1. Unlike the phage lambda integrases, 
which belong to a tyrosine class of recombinases, the serine integrases do not require 
cofactors such as integration host factor. The phiC31 integrase normally mediates 
integration of the phiC31 bacteriophage into the genome of Streptomyces via 
recombination between the attP recognition sequence of the phage genome and the 
20 attB recognition sequence within the bacterial genome. When a plasmid is equipped 
with a single attB site, phiC31 integrase will detect and mediate crossover between the 
attB site and a pseudo-attP site within the mammalian genome. Such pseudo-attP 
integration sites have now been identified in the mouse and human genomes. If the 
heterologous DNA is in a circular or supercoiled form, the entire plasmid becomes 
25 integrated with attL and attR arms flanking the nucleic acid insert. PhiC31 integrase 
is not able to mediate the integration into genomic DNA of sequences bearing attP 
sites. 

PhiC31 integrase-mediated integration results in the destruction of the 
recognition or recombination sites themselves so that the integration reaction is 
30 irreversible. This will bypass the primary concern inherent with other recombinases, 
i.e., the reversibility of the integration reaction and excision of the inserted DNA. 
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It has been estimated that there are 50 to 100 pseudo-attP sites in mammalian 
genomes (mouse and human) and some sites are apparently preferred for integration 
over others. The chicken genome, however, is only about one-third the size of 
mammalian genomes, and it was unknown whether there would be a sufficient 
5 number of pseudo attP sites in the chicken genome to allow efficient integrase- 
mediated integration. 

We have found that the phiC31 integrase is active in avian cells, increasing the 
rate of integration over that of a non-integrase-mediated integration. Furthermore, we 
have determined that the phiC31 integrase works well at both 37° Celsius and 41° 
10 Celsius, showing that it will function in the environment of a developing avian 
embryo. 

A need still exists, however, for methods by which avian chromosomes can be 
permanently modified in an efficient and site-specific manner and the genetically 
transformed cells used to generate transgenic birds. 

15 

Summary of the Invention 

Integration of a transgene into a defined chromosomal site is useful to improve 
the predictability of expression of the transgene, which is particularly advantageous 
when creating transgenic avians. Transgenesis by methods that randomly insert a 
20 transgene into an avian genome is often inefiRcient since the transgene may not be 
expressed at the desired levels or in desired tissues. 

A novel class of phage integrases, and in particular the integrase from phage 
phiC31, can mediate the efficient integration of transgenes into target cells both in 
vitro and in vivo. When a plasmid is equipped with a single attB site, phiCSl 
25 integrase detects attP homologous sequences, termed pseudo-attP sites, in a target 
genome and mediates crossover between the attB site and a pseudo attP site. 

The present invention provides novel methods and recombinant polynucleotide 
molecules for transfecting and integrating a heterologous nucleic acid molecule into 
the genome of an avian cell. The methods of the invention deliver to an avian cell 
30 population a first nucleic acid molecule that comprises a region encoding a bacterial 
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recombination site. A source of integrase activity also delivered to the avian cell can 
be an integrase-encoding nucleic acid sequence and its associated promoter included 
in the first nucleic acid molecule or as a region of a second nucleic acid molecule that 
may be co-delivered w^ith the polynucleotide molecule. Alternatively, integrase 
5 protein itself can be delivered directly to the target cell. 

The recombinant nucleic acid molecules of the present invention may further 
comprise a heterologous nucleotide sequence operably linked to a promoter so that the 
heterologous nucleotide sequence, when integrated into the genome DNA of a 
recipient avian cell, can be expressed to yield a desired polypeptide. The nucleic acid 

10 molecule may also include a second transcription initiation site, such as an internal 
ribosome entry site (IRES), operably linked to a second heterologous polypeptide- 
encoding region desired to be expressed with the first polypeptide in the same cell. 

The heterologous nucleic acid molecule of the present invention may include a 
cassette for the expression in a recipient avian cell of a desired heterologous 

15 polypeptide. Optionally, the nucleic acid molecules may further comprise a marker 
such as, but not limited to, a puromycin resistance gene, a luciferase gene, EGFP- 
encoding gene, and the like. 

Once delivered to a recipient avian cell, the phiCSl integrase mediates 
recombination between the att site within the nucleic acid molecule and a 

20 bacteriophage attachment site within the genomic DNA of the avian cell. Both att 
sites are disrupted and the nucleic acid molecule, with partial att sequences at each 
end, is stably integrated into the genome attP site. The phiC31 integrase, by 
disrupting the att sites of the incoming nucleic acid and of the recipient site within the 
avian cell genome, precludes any subsequent reverse recombination event that would 

25 excise the integrated nucleic acid and reduce the overall efficiency of stable 
incorporation of the heterologous nucleic acid. 

Following delivery of the nucleic acid molecule and a source of integrase 
activity into an avian cell population and integrase-mediated recombination, the cells 
may be returned to an embryo. Late stage blastodermal cells may be returned to a 

30 hard shell egg, which is resealed for incubation until hatching. Stage I embryos may 
be directly microinjected with the polynucleotide and source of integrase activity. 
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isolated, transfected and returned to a stage I embryo which is reimplanted into a hen 
for further development. Alternatively, the transfected cells may be maintained in 
vitro culture. 

The present invention further provides modified isolated avian or artificial 
5 chromosomes useful as vectors to shuttle transgenes or gene clusters into the avian 
genome. By delivery to the modified chromosome to an isolated recipient cell, the 
target cell, and progeny thereof, become trisomic. The additional or trisomic 
chromosome will not affect the subsequent development of the recipient cell and/or an 
embryo, nor interfere with the reproductive capacity of an adult bird developed from 

10 such cells or embryos. The chromosome will also be stable within chicken cells. The 
invention provides methods to isolate a population of chromosomes for delivery into 
chicken embryos or early cells. 

The method comprises inserting a lac-operator sequence into an isolated 
chromosome and, optionally, inserting a desired transgene sequence within the same 

15 chromosome. The lac operator region is typically a concatamer of a plurality of lac 
operators for the binding of multiple lac repressor molecules. A recombinant DNA 
molecule is constructed that includes an identified region of the target chromosome, a 
recombination site such as attB or attP, and the lac-operator concatamer. The 
recombinant molecule is delivered to an avian cell, and homologous recombination 

20 will integrate the heterologous polynucleotide and the lac-operator concatamer into 
the targeted chromosome. A tag-polypeptide, such as the GPF-lac-repressor fusion 
protein, binds to the lac-operator sequence for identification and isolation of the 
genetically modified chromosome. The tagged mitotic chromosome can be isolated 
using, for instance, flow cytometry. 

25 Another aspect of the present invention is an avian cell genetically modified 

with a transgene vector by the methods of the invention. For example, in one 
embodiment, the transformed cell can be a chicken early stage blastodermal cell or a 
genetically transformed cell line, including a sustainable cell line. The transfected cell 
may comprise a transgene stably integrated into the nuclear genome of the recipient 

30 cell, thereby replicating with the cell so that each progeny cell receives a copy of the 
transfected nucleic acid. A particularly useful cell line for the delivery and integration 
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of a transgene comprises a heterologous attP site that can increase the efficiency of 
integration of a polynucleotide by phiC31 integrase and, optionally, a region for 
expressing the integrase. 

Another aspect of the present invention is methods of expressing a 
5 heterologous polypeptide in an avian cell by stably transfecting a cell by using site- 
specific integrase-mediation and a recombinant nucleic acid molecule, as described 
above, and culturing the transfected cell under conditions suitable for expression of 
the heterologous polypeptide under the control of the avian transcriptional regulatory 
region. 

10 Yet another aspect of the present invention concerns transgenic birds, such as 

chickens, comprising a recombinant nucleic acid molecule and which preferably 
(though optionally) express a heterologous gene in one or more cells in the animal. 
Embodiments of the methods for the production of a heterologous polypeptide by the 
avian tissue involve providing a suitable vector and introducing the vector into 

15 embryonic blastodermal cells together with an integrase, preferably phiC31 integrase, 
so that the vector can integrate into the avian genome. A subsequent step involves 
deriving a mature transgenic avian from the transgenic blastodermal cells by 
transferring the transgenic blastodermal cells to an embryo and allowing that embryo 
to develop fully, so that the cells become incorporated into the bird as the embryo is 

20 allowed to develop. An alternative is to transfer a transfected nucleus to an enucleated 
recipient cell which may then develop into a zygote and ultimately an adult bird. The 

1 

resulting chick is then grown to maturity. 

In various embodiments of the transgenic bird of the present invention, the 
expression of the transgene may be restricted to specific subsets of cells, tissues or 

25 developmental stages utilizing, for example, trans-acting factors acting on the 
transcriptional regulatory region operably linked to the polypeptide-encoding region of 
interest of the present invention and which control gene expression in the desired 
pattern. Tissue-specific regulatory sequences and conditional regulatory sequences 
can be used to control expression of the transgene in certain spatial patterns. 

30 Moreover, temporal patterns of expression can be provided by, for example, 
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conditional recombination systems or prokaiyotic transcriptional regulatory 
sequences. 

The invention can be used to express, in large yields and at low cost, a wide 
range of desired proteins including those used as human and animal pharmaceuticals, 
5 diagnostics, and livestock feed additives. Proteins such as growth hormones, 
cytokines, structural proteins and enzymes including human growth hormone, 
interferon, lysozyme, and P-casein are examples of proteins which are desirably 
expressed in the oviduct and deposited in eggs according to the invention. 

The present invention includes methods of producing transgenic avians, for 
10 example, transgenic chickens, which employ the use of integrase, cationic polymers 
and/ nuclear localization signals. The present invention also includes the transgenic 
avians produced by these methods and other methods disclosed herein. The invention 
also includes the eggs produced by the transgenic avians produced by these methods 
and other methods disclosed herein. 
15 In one embodiment, the methods of the invention include introducing into an 

avian cell: 1) a nucleic acid comprising a transgene; 2) an integrase activity; and 3) a 
cationic polymer. Such methods provide for an increased efficiency of transgenic 
avian production relative to identical methods without the cationic polymer. 

In another embodiment, the methods include introducing into an avian cell: 1) 
>0 a nucleic acid comprising a transgene; 2) an integrase activity; and 3) and a nuclear 
localization signal. Such methods provide for an increased efficiency of transgenic 
avian production relative to identical methods without the nuclear localization signal. 

In another embodiment, the methods include introducing into an avian cell: 1) 
a nucleic acid comprising a transgene; 2) an integrase activity; 3) a cationic polymer; 
:5 and 4) a nuclear localization signal. Such methods provide for an increased efficiency 
of transgenic avian production relative to identical methods without the cationic 
polymer or without the nuclear localization signal. 

In one embodiment, the avian cell is a cell of an avian embryo. For example, 
the avian cell may be a cell of an early stage embryo comprising a germinal disc. The 
0 avian cell may be, for example, a cell of a stage I avian embryo, a cell of a stage H 
avian embiyo, a cell of a stage HI avian embryo, a cell of a stage IV avian embryo, a 
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cell of a stage V avian embryo, a cell of a stage VI avian embryo, a cell of a stage VII 
avian embryo, a cell of a stage VIII avian embryo, a cell of a stage IX avian embryo, a 
cell of a stage X avian embryo, a cell of a stage XI avian embryo or a cell of a stage 
XII avian embryo. In one particularly useful embodiment, the avian cell is a cell of a 
5 stage X avian embryo. 

The methods provide for the introduction of nucleic acid into the avian cell by 
any suitable technique known to those of skill in the art. For example, the nucleic 
acid may be introduced into the avian cell by microinjecting, transfection, 
electroporation or lipofection. In one particularly useful embodiment, the introduction 

1 0 of the nucleic acid is done by microinjecting. 

The nucleic acid which includes a transgene may be DNA or RNA or a 
combination of RNA and DNA. The nucleic acid may comprise a single strand or 
may comprise a double strand. The nucleic acid may be a linear nucleic acid or may 
be an open or closed circular nucleic acid and may be naturally occurring or synthetic. 

1 5 Integrase activity may be introduced into the avian cell in any suitable form. In 

one embodiment, an integrase protein is introduced into the avian cell. In another 
embodiment, a nucleic acid encoding an integrase is introduced into the avian cell. 
The nucleic acid encoding the integrase may be double stranded DNA, single stranded 
DNA, double stranded RNA, single stranded RNA or a single or double stranded 

20 nucleic acid which includes both RNA and DNA. In one particularly useful 
embodiment, the nucleic acid is mRNA. Integrase activity may be introduced into the 
avian cell by any suitable technique. Suitable techniques included those described 
herein for introducing the nucleic acid encoding a transgene into an avian cell. In one 
useful embodiment, the integrase activity is introduced into the avian cell with the 

25 nucleic acid encoding the transgene. For example, the integrase activity may be 
introduced into the avian cell in a mixture with the nucleic acid encoding the 
transgene. 

In one embodiment, a nuclear localization signal (NLS) is associated with the 
nucleic acid which includes a transgene. For example, the NLS may be associated 
30 with the nucleic acid by a chemical bond. Examples of chemical bonds by which an 
NLS may be associated with the nucleic acid include an ionic bond, a covalent bond. 
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hydrogen bond and Van der Waal's force. In one particularly useful embodiment, the 
nucleic acid which includes a transgene is associated with an NLS by an ionic bond. 
NLS may be introduced into the avian cell by any suitable technique. Suitable 
techniques included those described herein for introducing the nucleic acid encoding a 
5 transgene into an avian cell. In one useful embodiment, the NLS is introduced into 
the avian cell with the nucleic acid encoding the transgene. For example, the NLS 
may be introduced into the avian cell while associated with the nucleic acid encoding 
the transgene. 

Cationic polymers may be employed to facilitate the production of transgenic 

10 avians. For example, the cationic polymers may be employed in combination with 
integrase and/or NLS. Any suitable cationic polymer may be used. For example, and 
without limitation, one or more of polyethylenimine, polylysine, DEAE-dextran, 
starburst dendriraers and starburst polyamidoamine dendrimers may be used. In a 
particularly useful embodiment, the cationic polymer includes polyethylenimine. The 

15 cationic polymer may be introduced into the avian cell by any suitable technique. 
Suitable techniques included those described herein for introducing the nucleic acid 
encoding a transgene into an avian cell. In one useful embodiment, the cationic 
polymer is introduced into the avian cell in a mixture with the nucleic acid encoding 
the transgene. For example, the cationic polymer may be introduced into the avian 

20 cell while associated with the nucleic acid encoding the transgene. 

In one particularly useful embodiment of the invention, the transgene includes 
a coding sequence which is expressed in a cell of the transgenic avian producing a 
peptide or a polypeptide (e.g., a protein). The coding sequence may be expressed in 
any or all of the cells of the transgenic avian. For example, the coding sequence may 

25 be expressed in the blood, the magnum and/or the sperm of the transgenic avian. In a 
particularly useful embodiment of the invention, the polypeptide is present in an egg, 
for example, in the egg white, produce by the transgenic avian. 

The methods of the invention include introducing the avian cell into a recipient 
avian, for example, a hen, wherein the recipient avian produces an offspring which 

30 includes the transgene. The avian cell may be introduced into a recipient avian by any 
suitable technique. 
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The present invention also includes methods of dispersing nucleic acid in a 
cell, for example an avian cell (e.g., an avian embryo cell). These methods include 
introducing into a cell a nucleic acid and a dispersing agent, for example, a cationic 
polymer (e.g., polyethylenimine, polylysine, DEAE-dextran, starburst dendrimers 
5 and/or starburst polyamidoamine dendrimers) in an amount that will disperse the 
nucleic acid in a cell. In one embodiment, the methods include introducing an avian 
cell into a recipient avian wherein the recipient avian produces an offspring which 
includes the transgene. 

In one embodiment, the nucleic acid includes a transgene. NLS or integrase 
1 0 activity may also be introduced into the cell. 

Typically, the dispersing of the nucleic acid is a homogeneous dispersing. 

Any combination of features described herein are included within the scope of 
the present invention provided that the features included in any such combination are 
not mutually inconsistent as will be apparent from the context, this specification, and 
1 5 the knowledge of one of ordinary skill in the art. 

Additional objects and aspects of the present invention will become more 
apparent upon review of the detailed description set forth below when taken in 
conjunction with the accompanying figures, which are briefly described as follows. 

20 Brief Description of the Figures 

Fig. 1 illustrates phage integrase-mediated integration. A plasmid vector 

bearing the transgene includes the attB recognition sequence for the phage integrase. 

The vector along with integrase-coding mRNA, a vector expressing the integrase, or 

the integrase protein itself, are delivered into cells or embryos. The integrase 
25 recognizes DNA sequences in the avian genome similar to attP sites, termed pseudo- 

attP, and mediates recombination between the attB and pseudo-attP sites, resulting in 

the permanent integration of the transgene into the avian genome. 

Fig. 2 illustrates the persistent expression of luciferase from a nucleic acid 

molecule after phiC3 1 integrase-mediated integration into chicken cells. 
30 Fig. 3 illustrates the results of a puromycin resistance assay to measure phiCSl 

integrase-mediated integration into chicken cells. 
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Fig. 4 illustrates phiC31 integrase-mediated integration into quail cells. 
Puromycin resistance vectors bearing attB sites were cotransfected with phiC31 
integrase, or a control vector, into QT6 cells, a quail fibrosarcoma cell line. One day 
after transfection, puromycin was added. Puromycin resistant colonies were counted 
5 1 2 days post-transfection. 

Figs. 5 A and 5B illustrate that phiC31 integrase can facilitate multiple 
integrations per avian cell. A puromycin resistance vector bearing an attB site was 
cotransfected with an enhanced green fluorescent protein (EGFP) expression vector 
bearing an attB site, and a phiCSl integrase expression vector. After puromycin 
10 selection, many puromycin resistant colonies expressed EGFP in all of their cells. 
Figs. 5A and 5B are the same field of view with EGFP illuminated with ultraviolet 
light (Fig. 5A) and puromycin resistant colonies photographed in visible light (Fig. 
5B). In Fig. 5B, there are 4 puromycin resistant colonies, two of which are juxtaposed 
at the top. One of these colonies expressed EGFP. 
15 Fig. 6 shows maps of the small vectors used for integrase assays. 

Fig. 7 shows integrase promotes efficient integration of large transgenes in 
avian cells. 

Fig. 8 shows maps of large vectors used for integrase assays. 
Fig. 9 illustrates the nucleotide sequence of the integrase-expressing plasmid 
20 pCMV-31int(SEQiDNO: 1). 

Fig. 10 illustrates the nucleotide sequence of the plasmid pCMV-luc-attB 
(SEQ ID NO: 2). 

Fig. 11 illustrates the nucleotide sequence of the plasmid pCMV-luc-attP 
(SEQ ID NO: 3). 

25 Fig. 12 illustrates the nucleotide sequence of the plasmid pCMV-pur-attB 

(SEQ ID NO: 4). 

Fig. 13 illustrates the nucleotide sequence of the plasmid pCMV-pur-attP 
(SEQ ID NO: 5). 

Fig. 14 illustrates the nucleotide sequence of the plasmid pCMV-EGFP-attB 
30 (SEQ ID NO: 6). 
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Fig. 15 illustrates the nucleotide sequence of the plasmid pl2.0-lys- 
LSPIPNMM-CMV-pur-attB (SEQ ID NO: 7). 

Fig. 16 illustrates the nucleotide sequence of the plasmid pOMIFN-Ins-CMV- 
pur-attB (SEQ ID NO: 8). 
5 Fig. 17 illustrates the nucleotide sequence of the integrase-expressing plasmid 

pRSV-Int (SEQ ID NO: 9). 

Fig. 18 illustrates the nucleotide sequence of the plasmid pCR-XL-TOPO- 
CMV-pur-attB (SEQ ID NO: 10). 

Fig. 19 illustrates the nucleotide sequence of the attP containing 
1 0 polynucleotide SEQ ID NO: 11 . 

Fig. 20 illustrates in schematic form the integration of a heterologous att 
recombination site into an isolated chromosome. The attB sequence is linked to 
selectable maker such as a puromycin expression cassette and is flanked by sequences 
found in the target site of the chromosome to be modified. The DNA is transfected 
15 into cells containing the chromosome and stable transfectants are selected by drug 
resistance. Site specific integration may be confirmed by several techniques including 
PGR. 

Fig. 21 illustrates the persistent expression of luciferase from a nucleic acid 
molecule after phiC31 integrase-mediated integration into chicken cells bearing a 
20 wild-type attP sequence. 

Fig. 22 illustrates the distribution of plasmid DNA in a stage I embryo. 
Fig. 23 illustrates the distribution of plasmid DNA in a stage I embryo in the 
presence of low molecular weight polyethylenimine. 

Fig. 24 illustrates the distribution of plasmid DNA in a stage I embryo in the 
25 presence of low molecular weight polyethylenimine. 

Detailed Description of the Preferred Embodiments 

This description uses gene nomenclature accepted by the Cucurbit Genetics 
Cooperative as it appears in the Cucurbit Genetics Cooperative Report 18:85 (1995), 
30 which are incorporated herein by reference in its entirety. Using this gene 
nomenclature, genes are symbolized by italicized Roman letters. If a mutant gene is 
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recessive to the normal type, then the symbol and name of the mutant gene appear in 
italicized lower case letters. 

The disclosures of publications, patents, and published patent specifications 
referenced in this application are hereby incorporated by reference into the present 
5 disclosure to more fiilly describe the state of the art to which this invention pertains. 

Definitions 

For convenience, definitions of certain terms employed in the specification, 
examples, and appended claims are collected here. 

As used in this specification and the appended claims, the singular forms "a," 
10 "an" and "the" include plural references unless the content clearly dictates otherwise. 
Thus, for example, reference to "an antigen" includes a mixture of two or more such 
agents. 

The term "avian" as used herein refers to any species, subspecies or race of 
organism of the taxonomic class ava, such as, but not limited to chicken, turkey, duck, 

15 goose, quail, pheasants, parrots, finches, hawks, crows and ratites including ostrich, 
emu and cassowary. The term includes the various knovm strains of Gallus gallus, or 
chickens, (for example, White Leghorn, Brown Leghorn, Barred-Rock, Sussex, New 
Hampshire, Rhode Island, Australorp, Minorca, Amrox, California Gray), as well as 
strains of turkeys, pheasants, quails, duck, ostriches and other poultry commonly bred 

20 in commeFcial quantities. It also includes an individual avian organism in all stages of 
development, including embryonic and fetal stages. The term "avian" also may 
denote "pertaining to a bird", such as "an avian (bird) cell." 

The term "nucleic acid" as used herein includes any natural or synthetic linear 
and sequential array of nucleotides and nucleosides, for example cDNA, genomic 

25 DNA, mRNA, tRNA, oligonucleotides, oligonucleosides and derivatives thereof For 
ease of discussion, such nucleic acids may be collectively referred to herein as 
"constructs," "plasm ids," or "vectors." The term "nucleic acid" further includes 
modified or derivatized nucleotides and nucleosides such as, but not limited to, 
halogenated nucleotides such as, but not only, 5-bromouracil, and derivatized 

30 nucleotides such as biotin-labeled nucleotides. 
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The terms "polynucleotide," "oligonucleotide," and "nucleic acid sequence" 
are used interchangeably herein and include, but are not limited to, coding sequences 
(polynucleotide(s) or nucleic acid sequence(s) which are transcribed and translated 
into polypeptide in vitro or in vivo when placed under the control of appropriate 
5 regulatory or control sequences); control sequences (e.g., translational start and stop 
codons, promoter sequences, ribosome binding sites, polyadenylation signals, 
transcription factor binding sites, transcription termination sequences, upstream and 
downstream regulatory domains, enhancers, silencers, and the like); and regulatory 
sequences (DNA sequences to which a transcription factor(s) binds and alters the 
10 activity of a gene's promoter either positively (induction) or negatively (repression)). 
No limitation as to length or to synthetic origin are suggested by the terms described 
above. 

As used herein the terms "peptide," "polypeptide" and "protein" refer to a 
polymer of amino acids in a serial array, linked through peptide bonds. A "peptide" 

15 typically is a polymer of at least two to about 30 amino acids linked in a serial array 
by peptide bonds. The term "polypeptide" includes proteins, protein fragments, 
protein analogues, oligopeptides and the like. The term "polypeptides" contemplates 
polypeptides as defined above that are encoded by nucleic acids, produced through 
recombinant technology (isolated from an appropriate source such as a bird), or 

20 synthesized. The term "polypeptides" further contemplates polypeptides as defined 
above that include chemically modified amino acids or amino acids covalently or 
noncovalently linked to labeling moieties. 

The terms "percent sequence identity" or "percent sequence similarity" as used 
herein refer to the degree of sequence identity between two nucleic acid sequences or 

25 two amino acid sequences as determined using the algorithm of Karlin & Attschul, 
Proc. Natl. Acad. Sci. 87: 2264-2268 (1990), modified as in Karlin & Attschul, Proc. 
Natl. Acad. Sci. 90: 5873-5877 (1993). Such an algorithm is incorporated into the 
NBLAST and XBLAST programs of Attschul et al, 1990, T. Mol. Biol. Q15: 403- 
410. BLAST nucleotide searches are performed with the NBLAST program, score = 

30 100, word length = 12, to obtain nucleotide sequences homologous to a nucleic acid 
molecule of the invention. BLAST protein searches are performed with the XBLAST 
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program, score == 50, word length = 3, to obtain amino acid sequences homologous to 
a reference polypeptide. To obtain gapped alignments for comparison purposes. 
Gapped BLAST is utilized as described in Attschul et al., Nucl. Acids Res. 25: 3389- 
3402 (1997). When utilizing BLAST and Gapped BLAST programs, the default 
5 parameters of the respective programs (e.g. XBLAST and NBLAST) are used. Other 
algorithms, programs and default settings may also be suitable such as, but not only, 
the GCG-Sequence Analysis Package of the U.K. Human Genome Mapping Project 
Resource Centre that includes programs for nucleotide or amino acid sequence 
comparisons. Examples of preferred algorithms are FASTA and BESTFIT. 

1 0 The terms "recombinant nucleic acid" and "recombinant DNA" as used herein 

refer to combinations of at least two nucleic acid sequences that are not naturally 
found in a eukaryotic or prokaiyotic cell. The nucleic acid sequences may include, but 
are not limited to, nucleic acid vectors, gene expression regulatory elements, origins 
of replication, suitable gene sequences that when expressed confer antibiotic 

15 resistance, protein-encoding sequences and the like. The term "recombinant 
polypeptide" is meant to include a polypeptide produced by recombinant DNA 
techniques. A recombinant polypeptide may be distinct fi-om a naturally occurring 
polypeptide either in its location, purity or structure. Generally, a recombinant 
polypeptide will be present in a cell in an amount different from that normally 

20 observed in nature. 

The term "gene" or "genes" as used herein refers to nucleic acid sequences that 
encode genetic information for the synthesis of a whole RNA, a whole protein, or any 
portion of such whole RNA or whole protein. Genes that are not naturally part of a 
particular organism's genome are referred to as "foreign genes," "heterologous genes" 

25 or "exogenous genes" and genes that are naturally a part of a particular organism's 
genome are referred to as "endogenous genes". The term "gene product" refers to an 
RNA or protein that is encoded by the gene. "Endogenous gene products" are RNAs 
or proteins encoded by endogenous genes. "Heterologous gene products" are RNAs 
or proteins encoded by "foreign, heterologous or exogenous genes" and are, therefore, 

30 not naturally expressed in the cell. 

The term "expressed" or "expression" as used herein refers to the transcription 
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from a gene to give an RNA nucleic acid molecule at least complementary in part to a 
region of one of the two nucleic acid strands of the gene. The term "expressed" or 
"expression" as used herein may also refer to the translation from an RNA molecule 
to give a protein, a polypeptide or a portion thereof. 
5 The term "operably linked" refers to an arrangement of elements wherein the 

components so described are configured so as to perform their usual function. Control 
sequences operably linked to a coding sequence are capable of effecting the 
expression of the coding sequence. The control sequences need not be contiguous with 
the coding sequence, so long as they function to direct the expression thereof. For 

10 example, intervening untranslated yet transcribed sequences can be present between a 
promoter sequence and the coding sequence and the promoter sequence can still be 
considered "operably linked" to the coding sequence. 

The term "transcription regulatory sequences" as used herein refers to 
nucleotide sequences that are associated with a gene nucleic acid sequence and which 

1 5 regulate the transcriptional expression of the gene. Exemplary transcription regulatory 
sequences include enhancer elements, hormone response elements, steroid response 
elements, negative regulatory elements, and the like. 

The term "promoter" as used herein refers to the DNA sequence that 
determines the site of transcription initiation by an RNA polymerase. A "promoter- 

20 proximal element" is a regulatory sequence generally within about 200 base pairs of 
the transcription start site. 

The term "internal ribosome entry sites (IRES)" as used herein refers to a 
region of a nucleic acid, most typically an RNA molecule, wherein eukaryotic 
initiation of protein synthesis occurs far downstream of the 5' end of the RNA 

25 molecule. A 43S pre-initiation complex comprising the elf2 protein bound to GTP 
and Met-tRNAi"^**, the 40S ribosomal subunit, and factors elfJ and 31fl A may bind to 
an "IRES" before locating an AUG start codon. An "IRES" may be used to initiate 
translation of a second coding region downstream of a first coding region, wherein 
each coding region is expressed individually, but under the initial control of a single 

30 upstream promoter. An "IRES" may be located in a eukaryotic cellular mRNA. 

The term "coding region" as used herein refers to a continuous linear 
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arrangement of nucleotides which may be translated into a polypeptide. A full length 
coding region is translated into a full length protein; that is, a complete protein as 
would be translated in its natural state absent any post-translational modifications. A 
full length coding region may also include any leader protein sequence or any other 
5 region of the protein that may be excised naturally fh)m the translated protein. 

The terms "vector" or "nucleic acid vector" as used herein refer to a natural or 
synthetic single or double stranded plasmid or viral nucleic acid molecule (RNA or 
DNA) that can be transfected or transformed into cells and replicate independently of, 
or within, the host cell genome. The term "expression vector" as used herein refers to 
10 a nucleic acid vector that comprises a transcription regulatory region operably linked 
to a site wherein is, or can be, inserted, a nucleotide sequence to be transcribed and, 
optionally, to be expressed, for instance, but not limited to, a sequence coding at least 
one polypeptide. 

The term "transfection" as used herein refers to the process of inserting a 

15 nucleic acid into a host cell. Many techniques are well known to those skilled in the 
art to facilitate transfection of a nucleic acid into an eukaryotic cell. These methods 
include, for instance, treating the cells with high concentrations of salt such as a 
calcium or magnesium salt, an electric field, detergent, or liposome mediated 
transfection, to render the host cell competent for the uptake of the nucleic acid 

20 molecules, and by such methods as micro-injection into a pro-nucleus, sperm- 
mediated and restriction-mediated integration. 

The terms "recombinant cell" and "genetically transformed cell" refer to a cell 
comprising a combination of nucleic acid segments not found in a single cell with 
each other in nature. A new combination of nucleic acid segments can be introduced 

25 into an organism using a wide array of nucleic acid manipulation techniques available 
to those skilled in the art. The recombinant cell may harbor a vector that is 
extragenomic, i.e. that does not covalently insert into the cellular genome, including a 
non-nuclear (e.g. mitochondrial) genome(s). A recombinant cell may further harbor a 
vector or a portion thereof that is intragenomic, i.e. covalently incorporated within the 

30 genome of the recombinant cell. 
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As used herein, a "transgenic avian" is any avian, as defined above, including 
tiie chicken and quail, in which one or more of the cells of the avian contain 
heterologous nucleic acid introduced by manipulation, such as by transgenic 
techniques. The nucleic acid may be introduced into a cell, directly or indirectly, by 
5 introduction into a precursor of the cell by way of deliberate genetic manipulation, 
such as by microinjection or by infection with a recombinant virus. Genetic 
manipulation also includes classical cross-breeding, or in vitro fertilization. A 
recombinant DNA molecule may be integrated within a chromosome, or it may be 
extrachromosomally replicating DNA. 
10 The terms "chimeric animal" or "mosaic animal" are used herein to refer to 

animals in which the recombinant gene is found, or in which the recombinant is 
expressed, in some but not all cells of the animal. The term "tissue-specific chimeric 
animal" indicates that the recombinant gene is present and/or expressed in some 
tissues but not others. 

15 As used herein, the term "transgene" means a nucleic acid sequence that is 

partly or entirely heterologous, i.e., foreign, to the transgenic animal or cell into which 
it is introduced, or, is homologous to an endogenous gene of the transgenic animal or 
cell into which it is introduced, but which is designed to be inserted, or is inserted, 
into the animal's genome in such a way as to alter the genome of the cell into which it 

20 is inserted {e.g., it is inserted at a location which differs from that of the natural gene 
or its insertion results in a knockout). 

The term "cytokine" as used herein refers to any secreted polypeptide that 
affects a function of cells and modulates an interaction between cells in the immune, 
inflammatory or hematopoietic response. A cytokine includes, but is not limited to, 

25 monokines and lymphokines. Examples of cytokines include, but are not limited to, 
interferon (x2b, Interleukin-1 (IL-1), Interleukin-6 (IL-6), Interleukin-8 aL-8), Tumor 
Necrosis Factor-fl (TNF-D.) and Tumor Necrosis Factor □ (TNF-D.). 

The term "antibody" as used herein refers to polyclonal and monoclonal 
antibodies and fragments thereof, and immunologic binding equivalents thereof. 

30 Antibodies may include, but are not limited to polyclonal antibodies, monoclonal 
antibodies (mAbs), humanized or chimeric antibodies, single chain antibodies. Fab 
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fragments, F(ab')2 fragments, fragments produced by a Fab expression library, anti- 
idiotypic (anti-Id) antibodies, and epitope-binding fragments of any of the above. 

The term "immunoglobulin polypeptide" as used herein refers to a constituent 
polypeptide of an antibody or a polypeptide derived therefrom. An "immunological 
5 polypeptide" may be, but is not limited to, an immunological heavy or light chain and 
may include a variable region, a diversity region, joining region and a constant region 
or any combination, variant or truncated fomi thereof. The term "immunological 
polypeptides" fiuther includes single-chain antibodies comprised of, but not limited 
to, an immimoglobulin heavy chain variable region, an immunoglobulin light chain 
1 0 variable region and optionally a peptide linker. 

The terms "integrase" and "integrase activity" as used herein refer to a nucleic 
acid recombinase of the serine recombinase family of proteins. 

The term "source of integrase activity" as used herein refers to a polypeptide 
or multimeric protein having serine recombinase (integrase) activity in an avian cell. 
1 5 The term may further refer to a polynucleotide encoding the serine recombinase, such 
as an mRNA, an expression vector, a gene or isolated gene that may be expressed as 
the recombinase-specific polypeptide or protein. 

The term "recombination site" as used herein refers to a polynucleotide stretch 
comprising a recombination site normally recognized and used by an integrase. For 
20 example, X phage is a temperate bacteriophage that infects E. coli. The phage has one 
attachment site for recombination (attP) and the E. coli bacterial genome has an 
attachment site for recombination (attB). Both of these sites are recombination sites 
for X, integrase. Recombination sites recognized by a particular integrase can be 
derived from a homologous system and associated with heterologous sequences, for 
25 example, the attP site can be placed in other systems to act as a substrate for the 
integrase. 

The term "pseudo-recombination site" as used herein refers to a site at which 
an integrase can facilitate recombination even though the site may not have a sequence 
identical to the sequence of its wild-type recombination site. For example, a phiC31 
30 integrase and vector carrying a phiC31 wild-type recombination site can be placed 
into an avian cell. The wild-type recombination sequence aligns itself with a sequence 
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in the avian cell genome and the integrase facilitates a recombination event. When the 
sequence from the genomic site in the avian cell, where the integration of the vector 
took place, is examined, the sequence at the genomic site typically has some identity 
to, but may not be identical with, the wild-type bacterial genome recombination site. 
5 The recombination site in the avian cell genome is considered to be a pseudo- 
recombination site (e.g., a pseudo-attP site) at least because the avian cell is 
heterologous to the normal phiC31 phage/bacterial cell system. The size of the 
pseudo-recombination site can be determined through the use of a variety of methods 
including, but not limited to, (i) sequence alignment comparisons, (ii) secondary 
10 structural comparisons, (iii) deletion or point mutation analysis to find the functional 
limits of the pseudo-recombination site, and (iv) combinations of the foregoing. 

A nucleic acid firagment of interest may be a trait-producing sequence, by 
which it is meant a sequence conferring a non-native trait upon the cell in which the 
protein encoded by the trait-producing sequence is expressed. The term "non-native" 

1 5 when used in the context of a trait-producing sequence means that the trait produced is 
different than one would find in an unmodified organism which can mean that the 
organism produces high amounts of a natural substance in comparison to an 
unmodified organism, or produces a non-natural substance. For example, the genome 
of a bird could be modified to produce proteins not normally produced in birds such 

20 as, for instance, human or mouse antibodies, human cytokines, etc. Other useful traits 
include disease resistance, meat flavor, animal size, and the like. 

A nucleic acid fragment of interest may additionally be a "marker nucleic acid" 
or expressed as a "marker polypeptide". Marker genes encode proteins that can be 
easily detected in transformed cells and are, therefore, useful in the study of those 

25 cells. Examples of suitable marker genes include P-galactosidase, green or yellow 
fluorescent proteins, enhanced green fluorescent protein, chloramphenicol acetyl 
transferase, luciferase, and the like. Such regions may also include those 5' noncoding 
sequences involved with initiation of transcription and translation, such as the 
enhancer, TATA box, capping sequence, CAAT sequence, and the like 

30 The term "transformed" as used herein refers to a heritable alteration in a cell 

resulting from the uptake of a heterologous DNA. 
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The term "trisomic" as used herein refers to a cell or animal, such as an avian 
cell or bird that has a 2n+l chromosomal complement, where n is the haploid number 
of chromosomes, for the animal species concerned. 

Techniques useful for isolating and characterizing the nucleic acids and 
5 proteins of the present invention are well known to those of skill in the art and 
standard molecular biology and biochemical manuals may be consulted to select 
suitable protocols without undue experimentation. See, for example, Sambrook er a/ . 
1989, "Molecular Cloning: A Laboratory Manual", 2nd ed.. Cold Spring Harbor, the 
content of which is herein incorporated by reference in its entirety. 

10 

Abbreviations 

Abbreviations used in the present specification include the following: aa, 
amino acid(s); bp, base pair(s); kb, kilobase; att, bacterial recombination attachment 
site; lU, infectious units. 

15 In the standard method of integrase mediated-transgenesis, a serine 

recombinase integrase mediates recombination between an attB site on a transgene 
vector and a pseudo atfP site on a chromosome. In the method of the invention for 
integrase-mediated transgenesis, a heterologous wild-type attP site can be integrated 
into an avian nuclear genome to create a transgenic cell line or bird. A serine 

20 recombinase (integrase) and an attB-bearing transgene vector are then introduced into 
cells harboring the heterologous attP site, or into embryos derived from birds which 
bear the attP recombination site. The locations of attP and attB may be reversed such 
that the attB site is inserted into an avian chromosome and the attP sequence resides in 
an incoming transgene vector. In either case, the att site of the introduced vector 

25 would then preferentially recombine with the iiitegrated heterologous att site in the 
genome of the recipient cell. 

The methods of the invention are based, in part, on the discovery that there 
exist in avian genomes a number of specific nucleic acid sequences, termed pseudo- 
recombination sites, the sequences of which may be distinct from wild-type 

30 recombination sites but which can be recognized by a site-specific integrase and used 
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to promote the efficient insertion of heterologous genes or polynucleotides into the 
targeted avian nuclear genome. The inventors have identified pseudo-recombination 
sites in avian cells capable of recombining with a recombination site, such as an attB 
site within a recombinant nucleic acid molecule introduced into the target avian cell. 
5 The invention is also based on the prior integration of a heterologous att 
recombination site, typically isolated from a bacteriophage or a modification thereof, 
into the genome of the target avian cell. 

Integration into a predicted chromosomal site is useful to improve the 
predictability of expression, which is particularly advantageous when creating 

10 transgenic avians. Transgenesis by methods that result in insertion of the transgene 
into random positions of the avian genome is unpredictable since the transgene may 
not express at the expected levels or in the predicted tissues. 

The invention as disclosed herein, therefore, provides methods for site- 
specifilcally genetically transforming an avian nuclear genome. In general, an avian 

15 cell having a first recombination site in the nuclear genome is transformed with a site- 
specific polynucleotide construct comprising a second recombination sequence and 
one or more polynucleotides of interest. Into the same cell, integrase activity is 
introduced that specifically recognizes the first and second recombination sites under 
conditions such that the polynucleotide sequence of interest is inserted into the nuclear 

20 genome via an integrase-mediated recombination event between the first and second 
recombination sites. 

The integrase activity, or a source thereof, can be introduced into the avian cell 
prior to, or concurrent with, the introduction of the site-specific construct. The 
integrase can be delivered to a cell as a polypeptide, or by expressing the integrase 

25 from a source polynucleotide such as an mRNA or from an expression vector that 
encodes the integrase, either of which can be delivered to the target avian cell before, 
during or after delivery of the polynucleotide of interest. Any integrase that has 
activity in an avian cell may be usefiil in the present invention, including HK022 
(Kolot et al., Biotechnol. Bioeng., 84: 56-60 (2003)). Preferably, the integrase is a 

30 serine recombinase as described, for example, by Smith & Thorpe, in Mol. Microbiol, 
44: 299-307 (2002). More preferably, the integrase is a bacteriophage integrase such 
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as, but not limited to, TP901-1 (Stoll et al, J. Bad., 184: 3657-3663 (2002); Olivares 
et al.. Gene, 278:167-176 (2001), Most preferably, the integrase is from the phage 
phiC31. 

The nucleotide sequence of the junctions between an integrated transgene into 
5 the attP (or attB site) would be known. Thus, a PGR assay can be designed by one of 
skill in the art to detect when the integration event has occurred. The PGR assay for 
integration into a heterologous wild-type attB or attP site can also be readily 
incorporated into a quantitative PGR assay using Taqman™ or related technology so 
that the efficiency of integration can be measured. 
10 The minimal attB and attP sites able to catalyze recombination mediated by 

the phiCSl integrase are 34 and 39 bp, respectively. In cell lines that harbor a 
heterologous integrated attP site, however, integrase has a preference for the inserted 
attP over any pseudo-attP sites of similar length, because pseudo-attP sites have very 
low sequence identity (between 10 to 50% identity) compared to the more efficient 

15 wild-type attP sequence. It is within the scope of the methods of the invention, 
however, for the recombination site within the target avian genome to be a pseudo-att 
site such as a pseudo-attP site or an attP introduced into an avian genome. 

The sites used for recognition and recombination of phage and bacterial DNAs 
(the native host system) are generally non-identical, although they typically have a 

20 common core region of nucleic acids. The bacterial sequence is generally called the 
attB sequence (bacterial attachment) and the phage sequence is called the attP 
sequence (phage attachment). Because they are different sequences, recombination 
will result in a stretch of nucleic acids (called attL or attR for left and right) that is 
neither an attB sequence or an attP sequence, and likely is functionally unrecognizable 

25 as a recombination site to the relevant enzyme, thus removing the possibility that the 
en2yme will catalyze a second recombination reaction that would reverse the first. 

The integrase may recognize a recombination site where sequence of the 5' 
region of the recombination site can differ from the sequence of the 3' region of the 
recombination sequence. For example, for the phage phiC31 attP (the phage 

30 attachment site), the core region is 5'-TTG-3' the flanking sequences on either side are 
represented here as attP5' and attP3', the structure of the attP recombination site is. 
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accordingly, attP5'-TTG-attP3'. Correspondingly, for the native bacterial genomic 
target site (attB) the core region is 5'-TTG-3', and the flanking sequences on either 
side are represented here as attB5' and attB3', the structure of the attB recombination 
site is, accordingly, attB5'-TTG-attB3'. After a single-site, phiCBl integrase-mediated 
5 recombination event takes place between the phiCSl phage and the bacterial genome, 
the result is the following recombination product: attB5'-TTG-attP3'{phiC31 vector 
sequences}attP5'-TTG-attB3'. In the method of invention, the attB site will be within 
a recombinant nucleic acid molecule that may be delivered to a target avian cell. The 
corresponding attP (or pseudo-attP) site will be within the avian cell nuclear genome. 
10 Consequently, after phiC31 integrase mediated recombination, the recombmation 
product, the nuclear genome with the integrated heterologous polynucleotide will have 
the sequence attP5'-TTG-attB3'{heterologous polynucleotide}-attB5'-TTG-attP3'. 
Typically, after recombitiation the post-recombination recombination sites are no 
longer able to act as substrate for the phiCSl integrase. This results in stable 

1 5 integration with little or no integrase mediated excision. 

While the preferred recombination site to be included in the recombinant 
nucleic acid molecules and modified chromosomes of the present invention is the attP 
site, it is contemplated that any attP-like site may be used if compatible with the attB 
site. For instance, any pseudo-attP site of the chicken genome may be identified 

20 according to the methods of Example 7 below and used as a heterologous att 
recombination site. Such attP-like sites may have a sequence that is at least 25% 
identical to SEQ ID NO: 1 1 as shown in Fig. 19, such as described in Groth et al., 
Proc. Natl Acad. Sci. U.S.A. 97: 5995-6000 (2000) incorporated herein by reference 
in its entirety. Preferably the selected site will have at least the same degree of 

25 efficiency of recombination as the attP site (SEQ ID NO: 1 1) itself. 

In the methods of the present invention, the recipient avian cell population 
may be an isolated avian cell line such as, for example, DF-1 chicken fibroblasts, 
chicken DT40 cells or a cell population derived from an early stage embryo such as a 
chicken stage I or stage X embryo. A particularly usefiil avian cell population is 

30 blastodermal cells isolated fi-om a stage X avian embryo. The methods of the present 
invention, therefore, include steps for the isolation of blastodermal cells that are then 
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suspended in a cell culture medium or buffer for maintaining the cells in a viable state, 
and which allows the cell suspension to contact the nucleic acids of the present 
invention. It is also within the scope of the invention for the nucleic acid construct 
and the source of integrase activity to be delivered directly to an avian embryo such as 
5 a blastodermal layer, or to a tissue layer of an adult bird such as the lining of an 
oviduct. 

When the recipient avian cell population is isolated from an early stage avian 
embryo, the embryos must first be isolated. For stage I avian embryos from, for 
example, a chicken, a fertilized ovum is surgically removed from a bird before the 

10 deposition of the outer, hard shell has occurred. The nucleic acids for integrating a 
heterologous nucleic acid into a recipient avian cell genome may then be delivered to 
isolated embryos by lipofection, microinjection (as described in Example 6 below) or 
electroporation and the like. After delivery of the nucleic acid, the tiransfected embryo 
and its yolk may be deposited into the infiindibulum of a recipient hen for the 

1 5 deposition of egg white proteins and a hard shell, and laying of the egg. Stage X avian 
embryos are obtained from freshly laid fertilized eggs and the blastodermal cells 
isolated as a suspension of cells in a medium, as described in Example 4 below. 
Isolated stage X blastodermal cell populations, once transfected, may be injected into 
recipient stage X embryos and the hard shell eggs resealed according to the methods 

20 described in U.S. Patent No. 6,397,777. 

In the methods of the invention, once a heterologous nucleic acid is delivered 
to the recipient avian cell, the integrase activity is expressed. The expressed integrase 
(or injected integrase polypeptide) then mediates recombination between the att site of 
the heterologous nucleic acid molecule, and the att (or pseudo att) site within the 

25 genomic DNA of the recipient avian cell. 

It is within the scope of the present invention for the integrase-encoding 
sequence and a promoter operably linked thereto to be included in the delivered 
nucleic acid molecule and that expression of the integrase activity occurs before 
integration of the heterologous nucleic acid into the avian cell genome. Preferably, 

30 the integrase-encoding nucleic acid sequence and associated promoter are in an 
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expression vector that may be co-delivered to the recipient avian cell with the 
heterologous nucleic acid molecule to be integrated into the recipient genome. 

One suitable integrase expressing expression vector for use in the present 
invention is pCMV-C31int (SEQ ID NO: 1) as shown in Fig. 9, and described in 
5 Groth et al., Proc. Natl. Acad. Sci. U.S.A. 97: 5995-6000 (2000), incorporated herein 
by reference in its entirety. In pCMV-C31int, expression of the integrase-encoding 
sequence is driven by the CMV promoter. However, any promoter may be used that 
will give expression of the integrase in a recipient avian cell, including operably 
linked avian-specific gene expression control regions of the avian ovalbumin, 

10 lysozyme, ovomucin, ovomucoid gene loci, viral gene promoters, inducible 
promoters, the RSV promoter and the like. 

The recombinant nucleic acid molecules of the present invention for delivery 
of a heterologous polynucleotide to the genome of a recipient avian cell may comprise 
a nucleotide sequence encoding the attB attachment Site of Streptomyces ambofaciens 

15 as described in Thorpe & Smith, Proc. NcUl Acad. Sci. U.S.A. 95: 5505-5510 (1998). 
The nucleic acid molecule of the present invention further comprises an expression 
cassette for the expression in a recipient avian cell of a heterologous nucleic acid 
encoding a desired heterologous polypeptide. Optionally, the nucleic acid molecules 
may further comprise a marker such as, but not limited to, a puromycin resistance 

20 gene, a luciferase gene, EGFP, and the like. 

It is contemplated that the expression cassette for introducing a desired 
heterologous polypeptide comprises a promoter operably linked to a nucleic acid 
encoding the desired polypeptide and, optionally, a polyadenylation signal sequence. 
Exemplary nucleic acids suitable for use in the present invention are more fiilly 

25 described in the examples below. 

In the methods of the present invention, following delivery of the nucleic acid 
molecule and a source of integrase activity into an avian cell population, the cells are 
maintained under culture conditions suitable for the expression of the integrase and/or 
for the integrase to mediate recombination between the recombination site of the 

30 nucleic acid and recombination site in the genome of the recipient avian cell. When 
the recipient avian cell is cultured in vitro, such cells may be incubated at 37° Celsius 
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if the cells are chicken early stage blastodermal cells. They may then be injected into 
an embryo within a hard shell, which is resealed for incubation until hatching. 
Alternatively, the transfected cells may be maintained in in vitro culture. 

5 Site-Specific Nucleic Acid Constructs and Methods of Delivery to an Avian Cell 

The present invention provides methods for the site-specific insertion of a 
heterologous nucleic acid molecule into the nuclear genome of an avian cell by 
delivering to a target avian cell that has a recombination site in its nuclear genome, a 
source of integrase activity, a site-specific construct that has another recombination 
10 site and a polynucleotide of interest, and allowing the integrase activity to facilitate a 
recombination event between the two recombination sites, thereby integrating the 
polynucleotide of interest into the avian nuclear genome. 

(a) Expression vector nucleic acid molecules: A variety of recombinant nucleic acid 
expression vectors are suitable for use in the practice of the present invention. The 

15 site-specific constructs described herein can be constructed utilizing methodologies 
well known in the art of molecular biology (see, for example, Ausubd or Maniatis) in 
view of the teachings of the specification. As described above, the constructs are 
assembled by inserting into a suitable vector backbone a recombination site such as an 
attP or an attB site, a polynucleotide of interest operably linked to a gene expression 

20 control region of interest and, optionally a sequence encoding a positive selection 
marker. Polynucleotides of interest can include, but are not limited to, expression 
cassettes encoding a polypeptide to be expressed in the transformed avian cell or in a 
transgenic bird derived therefi-om. The site-specific constructs are typically circular 
and may also contain selectable markers, an origin of replication, and other elements. 

25 Any of the vectors of the present invention may also optionally include a 

sequence encoding a signal peptide that directs secretion of the polypeptide expressed 
by the vector from the transgenic cells, for instance, from tubular gland cells of the 
oviduct. This aspect of the invention effectively broadens the spectrum of exogenous 
proteins that may be deposited in the whites of avian eggs using the methods of the 

30 invention. Where an exogenous polypeptide would not otherwise be secreted, the 
vector bearing the coding sequence can be modified to comprise, for instance, about 
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60 bp encoding a signal peptide. The DNA sequence encoding tlie signal peptide is 
inserted in the vector such that the signal peptide is located at the N-terminus of the 
polypeptide encoded by the vector. 

The expression vectors of the present invention can comprise an avian 
5 transcriptional regulatory region for directing expression of either fiision or non-fusion 
proteins. With fusion vectors, a number of amino acids are usually added to the 
desired expressed target gene sequence such as, but not limited to, a polypeptide 
sequence for thioredoxin. A proteolytic cleavage site may fiirther be introduced at a 
site between the target recombinant protein and the fusion sequence. Additionally, a 

10 region of amino acids such as a polymeric histidine region may be introduced to allow 
binding of the fusion protein to metallic ions such as nickel bonded to a solid support, 
for purification of the fusion protein. Once the fusion protein has been purified, the 
cleavage site allows the target recombinant protein to be separated from the fusion 
sequence. Enzymes suitable for use in cleaving the proteolytic cleavage site include, 

15 but are not limited to. Factor Xa and thrombin. Fusion expression vectors that may be 
useful in the present invention include pGex (Amrad Corp., Melbourne, Australia), 
pRlT5 (Pharmacia, Piscataway, NJ) and pMAL (New England Biolabs, Beverly, MA), 
that fiise glutathione S-transferase, protein A, or maltose E binding protein, 
respectively, to a desired target recombinant protein. 
20 Epitope tags are short peptide sequences that are recognized by epitope 

specific antibodies. A fusion protein comprising a recombinant protein and an epitope 
tag can be simply and easily purified using an antibody bound to a chromatography 
resin, for example. The presence of the epitope tag furthermore allows the 
recombinant protein to be detected in subsequent assays, such as Western blots, 
25 without having to produce an antibody specific for the recombinant protein itself 
Examples of commonly used epitope tags include V5, glutathione-S-transferase 
(GST), hemaglutinin (HA), the peptide Phe-His-His-Thr-Thr, chitin binding domain, 
and the like. 

Preferred gene expression control regions for use in avian cells include, but are 
30 not limited to, avian specific promoters such as the chicken lysozyme, ovalbumin, or 
ovomucoid promoters, and the like. Particularly useful are tissue-specific promoters 
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such as avian oviduct promoters that allow for expression and delivery of a 
heterologous polypeptide to an egg white. 

Viral promoters serve the same flinction as bacterial or eukaryotic promoters 
and either provide a specific RNA polymerase in trans (bacteriophage T7) or recruit 
5 cellular factors and RNA polymerase (SV40, RSV, CMV), Viral promoters may be 
preferred as they are generally particularly strong promoters. A preferred promoter for 
use in avian cells is the RSV promoter. 

Selection markers are valuable elements in expression vectors as they provide 
a means to select for growth of only those cells that contain a vector. Common 
10 selectable marker genes include those for resistance to antibiotics such as ampicillin, 
puromycin, tetracycline, kanamycin, bleomycin, streptomycin, hygromycin, neomycin, 
ZEOCIN™, and the like. 

Another element useful in an expression vector is an origin of replication. 

Replication origins are unique DNA segments that contain multiple short repeated 
15 sequences that are recognized by multimeric origin-binding proteins and that play a 

key role in assembling DNA replication enzymes at the origin site. Suitable origins of 

replication for use in expression vectors employed herein include E. coli oriC, colEl 

plasm id origin, and the like. 

A further useful element in an expression vector is a multiple cloning site or 
20 polylinker. Synthetic DNA encoding a series of restriction endonuclease recognition 

sites is inserted into a vector, for example, downstream of the promoter element. 

These sites are engineered for convenient cloning of DNA into the vector at a specific 

position. 

Elements such as the foregoing can be combined to produce expression vectors 
25 suitable for use in the methods of the invention. Those of skill in the art will be able 
to select and combine the elements suitable for use in their particular system in view 
of the teachings of the present specification. 

(b) Genetically modified avian and artificial chromosomes: The present invention 
further provides modified chromosomes, either isolated avian or artificial 
30 chromosomes, are useful vectors to shuttle transgenes or gene clusters into the avian 
genome. By delivering the modified or artificial chromosome to an isolated recipient 
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cell, the target cell, and progeny thereof, become trisomic. Preferably, an additional or 
triosomic chromosome will not affect the subsequent development of the recipient cell 
and/or an erabtyo, nor interfere with the reproductive capacity of an adult bird 
developed from such cells or embryos. The chromosome also should be stable within 
5 chicken cells. An effective method is also required to isolate a population of 
chromosomes for delivery into chicken embryos or early cells. 

A number of artificial chromosomes are useful in the methods of the 
invention, including, for instance, a human chromosome modified to work as an 
artificial chromosome in a heterologous species as described, for example, for mice 

10 (Tomizuka et cd.. Proc. Natl. Acad. Sci. U.S.A. 97: 722-727 (2000); for cattle 
(Kuroiwa et al, Nat. Biotechnol. 20: 889-894 (2002); a mammalian artificial 
chromosome used in mice (Co et al.. Chromosome Res. 8: 183-191 (2000). 

Chickens that are trisomic for microchromosome 16 have been described 
(Miller et al., Proc. Natl. Acad. Sci. U.S.A. 93: 3958-3962 (1996); Muscarella et al, J. 

15 Cell Biol. 101: 1749-1756 (1985). In these cases, triploidy and trisomy occurred 
naturally, and illustrate that an extra copy of one or more of the chicken chromosomes 
is compatible with normal development and reproductive capacity. 

A useful chromosome isolation protocol can comprise the steps of inserting a 
lac-operator sequence (Robinett et al. J. Cell Biol 135: 1685-1700 (1996) into an 

20 isolated chromosome and, optionally, inserting a desired transgene sequence within 
the same chromosome. Preferably, the lac operator region is a concatamer of a 
plurality of lac operators for the binding of multiple lac repressor molecules. Insertion 
can be accomplished, for instance, by identifying a region of known nucleotide 
sequence associated with a particular avian chromosome. A recombinant DNA 

25 molecule may be constructed that comprises the identified region, a recombination 
site such as attB or attP and a lac-operator concatamer. The recombinant molecule is 
delivered to an isolated avian cell, preferably, but not limited to, chicken DT40 cells 
that have elevated homologous recombination activity compared to other avian cell 
lines, whereupon homologous recombination will integrate the heterologous 

30 recombination site and the lac-operator concatamer into the targeted chromosome as 
shown in the schema illustrated in Fig. 20. A tag-polypeptide comprising a label 
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domain and a lac repressor domain is also delivered to the cell, preferably by 
expression from a suitable expression vector. The nucleotide sequence coding for a 
GFP-lac-repressor fusion protein (Robinett et al, J. Cell Biol. 135: 1685-1700 (1996)) 
may be inserted into the same chromosome as the lac-operator insert. The lac 
5 repressor sequence, however, can also be within a different chromosome. An 
inducible promoter may also be used to allow the expression of the GFP-lac-repressor 
only after chromosome is to be isolated. 

Induced expression of the GPF-lac-repressor fusion protein will result in 
specific binding of the tag fusion polypeptide to the lac-operator sequence for 
10 identification and isolation of the genetically modified chromosome. The tagged 
mitotic chromosome can be isolated using, for instance, flow cytometry as described 
in de Jong et al. Cytometry 35: 129-133 (1999) and Griffin et al Cytogenet. Cell 
Genet. 87: 278-281 (1999). 

A tagged chromosome can also be isolated using microcell technology 
15 requiring treatment of cells with the mitotic inhibitor colcemid to induce the 
formation of micronuclei containing intact isolated chromosomes within the cell. 
Final separation of the micronuclei is then accomplished by centrifugation in 
cytochalasin as described by Killary & Foumier in Methods Enzymol. 254: 133-152 
(1995). Further purification of microcells containing only the desired tagged 
20 chromosome could be done by flow cytometry. It is contemplated, however, that 
alternative methods to isolate the mitotic chromosomes or microcells, including 
mechanical isolation or the use of laser scissors and tweezers, and the like. 

Delivery of a Site-Specific Nucleic Acid to a Recipient Avian Cell orEmbrvo. 

25 (a) Delivety of polynucleotide constructs. 

Most non-viral methods of gene transfer rely on normal mechanisms used by 
eukaryotic cells for the uptake and intracellular transport of macromolecules. In 
preferred embodiments, non-viral gene delivery systems of the present invention rely 
on endocytic pathways for the uptake of the subject transcriptional regulatory region 

30 and operably linked polypeptide-encoding nucleic acid by the targeted cell. 
Exemplary gene delivery systems of this type include liposomal derived systems, 
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poly-lysine conjugates, and artificial viral envelopes. Modified chromosomes as 
described above may be delivered to isolated avian embryonic cells for subsequent 
introduction to an embryo. 

In a representative embodiment, a nucleic acid molecule can be entrapped in 
5 liposomes bearing positive charges on their surface (e.g., lipofectins) and (optionally) 
which are tagged with antibodies against cell surface antigens of the target tissue 
(Mizuno et cd., 1992, NO Shinkei Geka 20: 547-551; PCT publication WO91/06309; 
Japanese patent application 1047381; and European patent publication EP-A-43075, 
all of which are incorporated herein by reference in their entireties). 

10 In similar fashion, the gene delivery system can comprise an antibody or cell 

surface ligand that is cross-linked with a gene binding agent such as polylysine (see, 
for example, PCT publications WO93/04701, W092/22635, WO92/20316, 
W092/19749, and WO92/06180, all of which are incorporated herein by reference in 
their entireties). It will also be appreciated that effective delivery of the subject 

15 nucleic acid constructs via receptor-mediated endocytosis can be improved using 
agents which enhance escape of genes from the endosomal structures. For instance, 
whole adenovirus or fiisogenic peptides of the influenza HA gene product can be used 
as part of the delivery system to induce efficient disruption of DNA-containing 
endosomes (Mulligan et al, 1993, Science 260-926; Wagner et al, 1992, Proc. Natl 

20 Acad. Sci. 89:7934-7938; and Christiano et al, 1993, Proc. Natl Acad. Sci. 90:2122- 
2126, all of which are incorporated herein by reference in their entireties). It is further 
contemplated that a recombinant nucleic acid molecule of the present invention may 
be delivered to a target host cell by other non-viral methods including by gene gun, 
microinjection, sperm-mediated transfer, or the like. 

25 In yet another embodiment of the invention, an expression vector that 

comprises a heterologous attB recombination site and a region encoding a polypeptide 
deposited into an egg white are delivered to oviduct cells by in vivo electroporation. 
In this method, the luminal surface of an avian oviduct is surgically exposed. A 
buffered solution of the expression vector and a source of integrase activity such as a 

30 second expression vector expressing integrase (for example pCMV-int) is deposited 
on the luminal surface. Electroporation electrodes are then positioned on either side 
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of the oviduct wall, the luminal electrode contacting the expression vector solution. 

After electroporation, the surgical incisions are closed. The electroporation will 

deliver the expression vectors to some, if not all, treated recipient oviduct cells to 

create a tissue-specific chimeric animal. Expression of the integrase allows for the 
5 integration of the heterologous polynucleotide into the genome of recipient oviduct 

cells. While this method may be used with any bird, a preferred recipient is a chicken 

due to the size of the oviduct. More preferred is a transgenic bird that has a transgenic 

attP recombinant site in the nuclear genomes of recipient oviduct cells, thus increasing 

the efficiency of integration of the expression vector. 
1 0 The attB/P integrase system is preferred in the in vivo electroporation method 

to allow the formation of stable genetically transformed oviduct cells that otherwise 

progressively lose the heterologous expression vector. 

The stably modified oviduct cells will express the heterologous polynucleotide 

and deposit the resulting polypeptide into the egg white of a laid egg. For this 
15 purpose, the expression vector will fiirther comprise an oviduct-specific promoter 

such as ovalbumin or ovomucoid operably linked to the desired heterologous 

polynucleotide. 

(b) Delivery of chromosomes to avian cells. 

Another aspect of the invention is the generation of a trisomic avian cell 
20 comprising a genetically modified extra chromosome. The extra chromosome may be 
an artificial chromosome or an isolated avian chromosome that has been genetically 
modified. Introduction of the extra chromosome to an avian cell will generate a 
trisomic cell with 2n+l chromosomes, where n is the haploid number of chromosomes 
of a normal avian cell. 

25 Delivery of an isolated chromosome into an isolated avian cell or embryo can 

be accomplished in several ways. Isolated mitotic chromosomes or a micronucleus 
containing an interphase chromosome can be injected into early stage I embryos by 
cytoplasmic injection. The injected zygote would then be surgically transferred to a 
recipient hen for the production and laying of a hard shell egg. This hard shell egg 

30 would then be incubated until hatching of a chick. 

Isolated microcells can be fiised to primordial germ cells (PGCs) isolated from 



-34- 



wo 2004/092351 



PCT/US2004/009253 



the blood stream of late stage 15 embryos as described by Killary & Foumier in 
Methods Enzymol. 254: 133-152 (1995). The PGC/microcell hybrids can then be 
transplanted into the blood stream of a recipient embryo to produce germline chimeric 
chickens. (See Naito M?/. Reprod. Dev. 39: 153-161 (1994)). The manipulated 
5 eggs would then incubated until hatching of the bird. 

Blastodermal cells isolated from stage X embiyos can be transfected with 
isolated mitotic chromosomes. Following in vitro transfection, the cells are 
transplanted back into stage X embryos as described, for example, in Etches et al.. 
Poult. Sci., 72: 882-829 (1993), and the manipulated eggs are incubated to hatching. 
1 0 Stage X blastodermal cells can also be fused with isolated microcells and then 

transplanted back into to stage X embiyos or fused to somatic cells to be used as 
nuclear donors for nuclear transfer as described by Kuroiwa et al., Nat. Biotechnol. 
20: 889-894 (2002). 

Chromosomal vectors, as described above, may be delivered to a recipient 

1 5 avian cell by, for example, microinjection, liposomal delivery or microcell fusion. 

In the methods of the invention, a site-specific integrase is introduced into an 
avian cell whose genome is to be modified. Methods of introducing functional 
proteins into cells are well known in the art. Introduction of purified integrase protein 
can ensure a transient presence of the protein and its activity. Thus, the lack of 

20 permanence associated with most expression vectors is not expected to be detrimental. 

The integrase used in the practice of the present invention can be introduced 
into a target cell before, concurrently with, or after the introduction of a site-specific 
vector. The integrase can be directly introduced into a cell as a protein, for example, 
by using liposomes, coated particles, or microinjection, or into the blastodermal layer 

25 of an early stage avian embryo by microinjection. A source of the integrase can also 
be delivered to an avian cell by introducing to the cell an mRNA encoding the 
integrase and which can be expressed in the recipient cell as an integrase polypeptide. 
Alternately, a DNA molecule encoding the integrase can be introduced into the cell 
using a suitable expression vector. 

30 The present invention provides novel nucleic acid vectors and methods of use 

that allow the phiC3 1 integrase to efficiently integrate a heterologous nucleic acid into 
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an avian genome. A novel finding is that the phiC3 1 integrase is remarkably efficient 
in avian cells and increases the rate of integration of heterologous nucleic acid at least 
30-fold over that of random integration. Furthermore, the phiCBl integrase works 
equally well at 37°C and 4rc, indicating that it will function in the environment of 
5 the developing avian embryo, as shown in Example 1 . 

It is important to note that the present invention is not bound by any 
mechanism or theory of operation. For example, the mechanism by which integrase, 
or any other substance described herein, facilitates transgenesis is unimportant. 
Integrase, for example, may fecilitate transgenesis by mediating the integration of 
10 DNA into the genome of a recipient cell or integrase may facilitate transgenesis by 
facilitating the entry of the DNA into the cell or integrase may facilitate transgenesis 
by some other mechanism. 

The site-specific vector components described above are useful in the 
construction of expression cassettes containing sequences encoding an integrase. One 
15 integrase-expressing vector useful in the methods of the invention is pCMV-C31int 
(SEQ ID NO: 1 as shown in Fig. 9) where the phiC31 integrase is encoded by a region 
under the expression control of the strong CMV promoter. Another preferred 
promoter generally useful in avian cells is the RSV promoter as used in SEQ ID NO: 9 
shown in Fig. 17. Expression of the integrase is typically desired to be transient. 
20 Accordingly, vectors providing transient expression of the integrase are preferred. 
However, expression of the integrase can be regulated in other ways, for example, by 
placing the expression of the integrase under the control of a regulatable promoter 
(i.e., a promoter whose expression can be selectively induced or repressed). 

Delivery of the nucleic acids introduced into avian cells, for example, 
25 embryonic avian cells,, using methods of the invention may also be enhanced by 
mixing the nucleic acid to be introduced with a nuclear localization signal (NLS) 
peptide prior to introduction, for example, microinjection, of the nucleic acid. 
Nuclear localization signal (NLS) sequences are a class of short amino acid sequences 
which may be exploited for cellular import of linked cargo into a nucleus. The 
30 present invention envisions the use of any useful NLS peptide, including but not 
limited to, the NLS peptide of SV40 virus T-antigen. 
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An NLS of the invention is an amino acid sequence which mediates nuclear 
transport into the nucleus, wherein deletion of the NLS reduces transport into the 
nucleus. In certain embodiments, an NLS is a cationic peptide, for example, a highly 
cationic peptide. The present invention includes the use of any NLS sequence, 
5 including but not limited to, SV40 virus T-antigen. NLSs known in the art include, 
but are not limited to those discussed in Cokpl et al, 2000, EMBO Reports, 1(5):411- 
415, Boulikas, T., 1993, Crit. Rev. Eukaryot. Gene Expr., 3:193-227, CoUas, P. et al, 
1996, Transgenic Research, 5: 451-458, Collas and Alestrom, 1997, Biochem. Cell 
Biol. 75: 633-640, Collas and Alestrom, 1998, Transgenic Research, 7: 303-309, 
10 Collas and Alestrom, Mol Reprod. Devel, 1996, 45:431-438. The disclosure of each 
of these references is incorporated by reference herein in its entirety. 

Not to be bound by any mechanism of operation, DNA is protected and hence 
stabilized by cationic polymers. The stability of DNA molecules in the cytoplasm of 
cells may be increased by mixing the DNA to be introduced, for example, 
1 5 microinjected with cationic polymers (for example, branched cationic polymers), such 
as polyethylenimine (PEI), polylysine, DEAE-dextran, starburst dendrimers, starburst 
polyamidoamine dendrimers, and other materials that package and condense the DNA 
molecules (Kukowska-Latallo et al., 1996, Proc. Natl. Acad. Sci. USA 93:4897-4902). 
Once the DNA molecules are delivered to the cytoplasm of cells, they migrate 
20 into the cell's endocytotic vesicles. Furthermore, migration into the cell's endosome 
is followed by fast inactivation of DNA within the endolysosomal compartment in 
transfected or injected cells, both in vitro and in vivo (Godbey, W, et al. 1999, Proc 
Natl Acad Sci USA 96: 5177-81; and Lechardeur, D, et al. 1999, Gene Ther 6: 482- 
97; and references cited therein). Accordingly, in certain embodiments, DNA uptake 
25 is enhanced by the receptor-mediated endocytosis pathway using transferrin- 
polylysine conjugates or adenoviral-mediated vesicle disruption to effect the release of 
DNA from endosomes. However, the invention is not limited to this or any other 
theory or mechanism of operation referred to herein. 

Buffering the endosomal pH using endosoraal-scaping elements also protects 
30 DNA from degradation (Kircheis, R, et al. 2001, Adv Drug Deliv Rev 53: 341-58 ; 
Boussif, O. et al. 1995, Proc Natl Acad Sci USA 92: 7297-301; and Pollard, H, et al. 
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1998, J Biol Chem 273: 7507-11; and references cited therein). Thus, in certain 
embodiments, DNA complexes are delivered with polycations or cationic polymers 
that possess substantial buffering capacity below physiological pH, such as 
polyethylenimine, lipopolyamines and polyamidoamine polymers. In certain 
5 embodiments, DNA condensing compounds, such as the ones described above, are 
combined with viruses (Curiel, D, et al. Proc Natl Acad Sci USA8S: 8850-4, 1991; 
Wagner, E, etal. Proc Natl Acad Sci USA 89: 6099-103, 1992 and Gotten, M, etal., 
1992, Proc Natl Acad Sci US A 89: 6094-8), viral peptides (Wagner, E, et al. 1992, 
Proc Natl Acad Sci US A 89: 7934-8; Plank, C, et al. 1994, J Biol Chem 269: 12918- 
10 24) and subunits of toxins (Uherek, C, et al., 1998, J Biol Chem 273: 8835-48). These 
materials significantly enhance the release of DNA from endosomes. In certain 
embodiments, viruses, viral peptides, toxins or subunits of toxins may be coupled to 
DNA/polylysine complexes via biochemical means or specifically by a streptavidin- 
biotin bridge (Wagner et al, 1992, Proc. Natl. Acad. Sci. USA 89:6099-6103; Plank et 

15 al., 1994, J. Biol Chem. 269(17):12918-12924). hi other certain embodiments, the 
virus that is complexed with the DNA may be adenovirus, retrovirus, vaccinia virus, 
or parvovirus. The viruses may be linked to PEI or another cationic polymer 
associated with the nucleic acid. In certain embodiments, the virus may be alphavirus, 
orthomyxovirus, or picomavirus. In certain embodiments, the virus is defective or 

20 chemically inactivated. The vims may be inactivated by short-wave UV radiation or 
the DNA intercalator psoralen plus long-wave UV. The adenovirus may coupled to 
polylysine, either enzymatically through the action of transglutaminase or 
biochemically by biotinylating adenovirus and streptavidinylating the polylysine 
moiety. Transferrin may also be useflil in combination with cationic polymers, 

25 adenoviruses and/or other materials disclosed herein to produce transgenic avians. 
For example, DNA complexes containing PEI, PEI-modified transferrin, and PEI- 
bound influenza peptides may be used to enhance transgenic avian production. 

In other certain embodiments, complexes containing plasmid DNA, 
transferrin-PEI conjugates, and PEI-conjugated peptides derived from the N-terminal 

30 sequence of the influenza virus hemagglutinin subunit HA-2 may be used to produce 
transgenic chickens. In certain embodiments, the PEI-conjugated peptide may be a an 
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amino-terminal amino acid sequence of influenza virus hemagglutinin which may be 
elongated by an amphipathic helix or by carboxyl-terminal dimerization. 

The present invention provides for methods of dispersing or distributing 
nucleic acid in a cell, for example, in an avian cell. The avian cell may be, for 
5 example, and without limitation, a cell of a stage I avian embryo, a cell of a stage II 
avian embryo, a cell of a stage ffl avian embryo, a cell of a stage IV avian embryo, a 
cell of a stage V avian embryo, a cell of a stage VI avian embryo, a cell of a stage VII 
avian embryo, a cell of a stage VIII avian embryo, a cell of a stage IX avian embryo, a 
cell of a stage X avian embryo, a cell of a stage XI avian embryo or a cell of a stage 
10 Xn avian embryo. In one particularly useful embodiment, the avian cell is a cell of a 
stage X avian embryo. 

In one aspect of the present invention, cationic polymers are usefiil to 
distribute, for example, homogeneously distribute, nucleic acid introduced into a cell, 
for example, an embryonic avian cell. The present invention contemplates the use of 
1 5 cationic polymers including, but not limited to, those disclosed herein. 

However, substances other than cationic polymers also capable of distributing 
or dispersing nucleic acids in a cell are included within the scope of the present 
invention. 

The concentration of cationic polymer used is not critical though, preferably, 
20 enough cationic polymer is present to coat the nucleic acid to be introduced into the 
avian cell. The cationic polymer may be present in an aqueous mixture with the 
nucleic acid to be introduced into the cell at a concentration in a range of an amount 
equal to about the weight of the nucleic acid to a concentration wherein the solution is 
saturated with cationic polymer. In one useful embodiment, the cationic polymer is 
25 present in an amount in a range of about 0.01% to about 50 %, for example, about 
0.1% to about 20% (e.g., about 5%). The molecular weights of the cationic polymers 
can range from a molecular weight of about 1,000 to a molecular weight of about 
1,000,000. In one embodiment, the molecular weight of the cationic polymers range 
from about 5,000 to about 100,000 for example, about 20,000 to about 30,000. 
30 In one particularly useful aspect of the invention, procedures that are effective 

to facilitate the production of a transgenic avian may be combined to provide for an 
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enhanced production of a transgenic avian wherein the enhanced production is an 
improved production of a transgenic avian relative to the production of a transgenic 
avian by only one of the procedures employed in the combination. For example, one 
or more of integrase activity, NLS, cationic polymer or other technique useful to 
5 enhance transgenic avian production disclosed herein can be used in the same 
procedure to provide for an enhanced production of transgenic avians relative to an 
identical procedure which does not employ all of the same techniques useful to 
enhance transgenic avian production. 

10 Transgenic Avian Cells. 

Another aspect of the present invention is an avian cell genetically modified 
w^ith a transgene vector according to the present invention and described above. For 
example, in one embodiment, the transformed cell can be a chicken early stage 
blastodermal cell or a genetically transformed cell line, including a sustainable cell 

15 line. The transfected cell according to the present invention may comprise a transgene 
stably integrated into the nuclear genome of the recipient cell, thereby replicating with 
the cell so that each progeny cell receives a copy of the transfected nucleic acid. A 
particularly useful cell line for the delivery and integration of a transgene comprises a 
heterologous attP site that can increase the efficiency of integration of a 

20 polynucleotide by phiC31 integrase and, optionally, a region for expressing the 
integrase. 

A retroviral vector can be used to deliver the att site into the avian genome 
since an attP or attB site is less than 300 bp. For example, the attP site can be inserted 
into the NLB retroviral vector, which is based on the avian leukosis virus genome. A 
23 lentiviral vector is a particularly suitable vector because lentiviral vectors can 
transduce non-dividing cells, so that a higher percentage of cells will have an 
integrated attP site. 

The lacZ region of NLB is replaced by the attP sequence. A producer cell line 
would be created by transformation of, for example, the Isolde cell line capable of 
30 producing a packaged recombinant NLB-attP virus pseudo-typed with the envA 
envelope protein. Supernatant from the Isolde NLB-attP line is concentrated by 
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centrifugation to produce high titer preparations of the retroviral vector that can then 
be used to deliver the attP site to the genome of an avian cell, as described in Example 
9 below. 

An attP-containing line of transgenic birds are a source of attP transgenic 
5 embryos and embryonic cells. Fertile zygotes and oocytes bearing a heterologous attP 
site in either the maternal, paternal, or both, genomes can be used for transgenic 
insertion of a desired heterologous polynucleotide. A transgene vector bearing an attB 
site, for example, would be injected into the cytoplasm along with either an integrase 
expression plasmid, mRNA encoding the integrase or the purified integrase protein, 
10 The oocyte or zygote is then cultured to hatch by ex ovo methods or reintroduced into 
a recipient hen such that the hen lays a hard shell egg the next day containing the 
injected egg. 

In another example, fertile stage VII-XII embryos hemizygous or homozygous 
for the heterologous attP sequence, are used as a source of blastodermal cells. The 

15 cells are harvested and then transfected with a transgene vector bearing an attB site 
along with a source of integrase. The transfected cells are then injected into the 
subgerminal cavity of windowed fertile eggs. The chicks that hatch will bear the 
transgene integrated into the attP site in a percentage of their somatic and germ cells. 
To obtain fiiUy transgenic birds, chicks are raised to sexiial maturity and those that are 

20 positive for the transgene in their semen are bred to non-transgenic mates. 

hi various embodiments, the genetically engineered cells of the invention may 
contain an integrase specifically recognizing recombination sites and which is 
introduced into genetically engineered cells containing a nucleic acid construct of the 
invention under conditions such that the nucleic acid sequence(s) of interest will be 

25 inserted into the nuclear genome. Methods for introducing such an integrase into a 
cell are described above. 

In some embodiments, the site-specific integrase is introduced into the cell as a 
polypeptide. In alternative embodiments, the site-specific integrase is introduced into 
the transgenic cell as a polynucleotide encoding the integrase, such as an expression 

30 cassette optionally carried on a transient expression vector, and comprising a 
polynucleotide encoding the recombinase. 



-41 - 



wo 2004/092351 



PCT/US2004/009253 



In one embodiment, the invention is directed to methods of using a vector for 
site-specific integration of a heterologous nucleotide sequence into the genome of an 
avian cell, the vector comprising a circular backbone vector, a polynucleotide of 
interest operably linked to a promoter, and a first recombination site, wherein the 
5 genome of the cell comprises a second recombination site and recombination between 
the first and second recombination sites is facilitated by phiC31 integrase. In certain 
embodiments, the integrase facilitates recombination between a bacterial genomic 
recombination site (attB) and a phage genomic recombination site (attP). 

In another embodiment, the invention is directed to an avian cell having a 

10 transformed genome comprising an integrated heterologous polynucleotide of interest 
whose integration, mediated by phiC31 integrase, was into a recombination site native 
to the avian cell genome and the integration created a recombination-product site 
comprising the polynucleotide sequence. In yet another embodiment, integration of 
the polynucleotide was into a recombination site not native to the avian cell genome, 

15 but instead into a heterologous recombination site engineered into the avian cell 
genome. 

In further embodiments, the invention is directed to transgenic birds 
comprising a modified cell and progeny thereof as described above, as well as 
methods of producing the same. 

20 Cells genetically modified to carry a heterologous attB or attP site by the 

methods of the present invention can be maintained under conditions that, for 
example, keep them alive but do not promote growth, promote growth of the cells, 
and/or cause the cells to differentiate or dedifferentiate. Cell culture conditions may 
be permissive for the action of the integrase in the cells, although regulation of the 

25 activity of the integrase may also be modulated by culture conditions (e.g., raising or 
lowering the temperature at which the cells are cultured). 

One aspect of the invention is a method for generating a genetically modified 
avian cell, and progeny thereof, using a tagged chromosome, the method comprising 
the steps of providing an isolated modified chromosome comprising a lac operator 

30 region and a first recombination site, delivering the modified chromosome to a avian 
cell, thereby generating a trisomic avian cell, delivering to the avian cell a source of a 
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tagged polypeptide comprising a fluorescent domain and a lac repressor domain, 
delivering a source of integrase activity to the avian cell, delivering a polynucleotide 
comprising a second recombination site and a region encoding a polypeptide to the 
avian cell, maintaining the avian cell under conditions suitable for the integrase to 
5 mediate recombination between the first and second recombination sites, thereby 
integrating the polynucleotide into the modified chromosome and generating a 
genetically modified avian cell, expressing the tag polypeptide by the avian cell, 
allowing the tag polypeptide to bind to the modified chromosome so as to label the 
modified chromosome, and isolating the modified chromosome by selecting modified 
1 0 chromosomes having a tag polypeptide bound thereto. 

In one embodiment of the invention, the second avian cell is selected from the 
group consisting of a stage VH-XII blastodermal cell, a stage I embryo, a stage X 
embryo; an isolated primordial genn cell, an isolated non-embryonic cell, and an 
oviduct cell. 

15 In various embodiments, the isolated modified chromosome is an avian 

chromosome or an artificial chromosome. 

In other embodiments of the invention, the step of providing an isolated 

modified chromosome comprising a lac operator region and a first recombination site 

comprises the steps of generating a trisomic avian cell by delivering to an isolated 
20 avian cell an isolated chromosome and a polynucleotide comprising a lac operator and 

a second recombination site, maintaining the trisomic cell under conditions whereby 

the heterologous polynucleotide is integrated into the chromosome by homologous 

recombination, delivering to the avian cell a source of a tag polj^eptide to label the 

chromosome, and isolating the labeled chromosome. 
25 In one embodiment of the invention, the lac operator region is a concatamer of 

lac operators. In other embodiments of the invention, the tag polypeptide is expressed 

from an expression vector. 

In one embodiment of the invention, the tag polypeptide is microinjected into 

the cell. In various embodiments of the invention, the method of delivery of a 
30 chromosome to an avian cell is selected irom the group consisting of liposome 
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delivery, microinjection, microcell, electroporation and gene gun delivery, or a 
combination thereof. 

In embodiments of the invention, the fluorescent domain of the tag 
polypeptide is GFP. 

5 In another embodiment of the invention, the method further comprises the 

step of delivering the second avian cell to an avian embryo. The embryo may be 
maintained under conditions suitable for hatching as a chick. 

In one embodiment of the invention, the second avian cell is maintained under 
conditions suitable for the proliferation of the cell, and progeny thereof. 

10 In various embodiments of tiie invention, the source of integrase activity is 

delivered to a first avian cell as a polypeptide or expressed from a polynucleotide, said 
polynucleotide being selected from an mRNA and an expression vector. 

In one embodiment of the invention, the tag polypeptide activity is delivered 
to the avian cell as a polypeptide or expressed from a polynucleotide operably linked 

15 to a promoter. In another embodiment of the invention, the promoter is an inducible 
promoter. In yet another embodiment of the invention, the integrase is phiCSl 
integrase and in various embodiments of the invention, the first and second 
recombination sites are selected from an attB and an attP site, but wherein the first and 
second sites are not identical. 

20 

Expression of Heterologous Proteins by Site-Specific Genetic Transformation of 
Avian Cells. 

Another aspect of the present invention is a method of expressing a 
heterologous polypeptide in an avian cell by stably transfecting a cell by using site- 
25 specific integrase-mediation and a recombinant nucleic acid molecule, as described 
above, and culturing the transfected cell under conditions suitable for expression of 
the heterologous polypeptide under the control of the avian transcriptional regulatory 
region. 

The protein of the present invention may be produced in purified form by any 
30 known conventional techniques. For example, chicken cells, an egg or an egg white 
may be homogenized and centrifuged. The supernatant may then be subjected to 



-44- 



wo 2004/092351 



PCT/US2004/009253 



sequential ammonium sulfate precipitation and heat treatment. The fraction 
containing the protein of the present invention is subjected to gel filtration in an 
appropriately sized dextran or polyacrylamide column to separate the proteins. If 
necessary, the protein fraction may be further purified by HPLC or other methods well 
5 known in the art of protein purification. 

The methods of the invention are useful for expressing nucleic acid sequences 
that are optimized for expression in avian cells and which encode desired polypeptides 
or derivatives and fragments thereof. Derivatives include, for instance, polypeptides 
with conservative amino acid replacements, tiiat is, those within a family of amino 

1 0 acids that are related in their side chdns (commonly known as acidic, basic, nonpolar, 
and uncharged polar amino acids). Phenylalanine, tryptophan, and tyrosine are 
sometimes classified jointly as aromatic amino acids and other groupings £ire known 
in the art (see, for example, "Biochemistry", 2nd ed, L. Stryer, ed., W.H. Freeman & 
Co.,1981). Peptides in which more than one replacement has taken place can readily 

IS be tested for activity in the same manner as derivatives with a single replacement, 
using conventional polypeptide activity assays (e.g. for enzymatic or ligand binding 
activities). 

Regarding codon optimization, if the recombinant nucleic acid molecules are 
transfected into a recipient chicken cell, the sequence of the nucleic acid insert to be 

20 expressed can be optimized for chicken codon usage. This may be determined from 
the codon usage of at least one, and preferably more than one, protein expressed in a 
chicken cell according to well known principles. For example, in the chicken the 
codon usage could be determined from the nucleic acid sequences encoding the 
proteins such as lysozyme, ovalbumin, ovomucin and ovotransferrin of chicken. 

25 Optimization of the sequence for codon usage can elevate the level of translation in 
avian eggs. 

The present invention provides methods for the production of a protein by an 
avian cell comprising the steps of maintaining an avian cell, transfecting with a first 
expression vector and, optionally, a second expression vector, under conditions 
30 suitable for proliferation and/or gene expression and such that an integrase will 
mediate site specific recombination at att sites. The expression vectors may each have 
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a transcription unit comprising a nucleotide sequence encoding a heterologous 
polypeptide, wherein one polypeptide is an integrase, a transcription promoter, and a 
transcriptional terminator. The ceils may then be maintained under conditions for the 
expression and production of the desired heterologous polypeptide(s). 
5 The present invention further relates to methods for gene expression by avian 

cells from nucleic acid vectors, and transgenes derived therefrom, that include more 
than one polypeptide-encoding region wherein, for example, a first polypeptide- 
encoding region can be operatively linked to an avian promoter and a second 
polypeptide-encoding region is operatively linked to an Internal Ribosome Entry 

10 Sequence (IRES). It is contemplated that the first polypeptide-encoding region, the 
IRES and the second polypeptide-encoding region of a recombinant DNA of the 
present invention may be arranged linearly, with the IRES operably positioned 
immediately 5' of the second polypeptide-encoding region. This nucleic acid 
construct, when inserted into the genome of an avian cell or a bird and expressed 

15 therein, will generate individual polypeptides that may be post-translationally 
modified and combined in the white of a hard shell bird egg. Alternatively, the 
expressed polypeptides may be isolated from an avian egg and combined in vitro. 

The invention, therefore, includes methods for producing multimeric proteins 
including immunoglobulins, such as antibodies, and antigen binding fragments 

20 thereof. Thus, in- one embodiment of the present invention, the multimeric protein is 
m immunoglobulin, wherein the first and second heterologous polypeptides are 
immunoglobulin heavy and light chains respectively. Illustrative examples of this and 
other aspects of the present invention for the production of heterologous multimeric 
polypeptides in avian cells are fully disclosed in U.S. Patent Application No. 

25 09/877,374, filed June 8, 2001, by Rapp, published as US-2002-0108132-A1 on 
August 8, 2002, and U.S. Patent Application No. 10/251,364, filed September 18, 
2002, by Rapp, both of which are incorporated herein by reference in their entirety. 

Accordingly, the invention further provides immunoglobulin and other 
multimeric proteins that have been produced by transgenic avians of the invention. 

30 In various embodiments, an immunoglobulin polypeptide encoded by the 

transcriptional unit of at least one expression vector may be an immunoglobulin heavy 
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chain polypeptide comprising a variable region or a variant thereof, and may further 
comprise a D region, a J region, a C region, or a combination thereof An 
immunoglobulin polypeptide encoded by an expression vector may also be an 
immunoglobulin light chain polypeptide comprising a variable region or a variant 
5 thereof, and may fUrther comprise a J region and a C region. The present invention 
also contemplates multiple immunoglobulin regions that are derived from the same 
animal species, or a mixture of species including, but not only, human, mouse, rat, 
rabbit and chicken. In preferred embodiments, the antibodies are human or 
humanized. 

10 In other embodiments, the immunoglobulin polypeptide encoded by at least 

one expression vector comprises an immunoglobulin heavy chain variable region, an 
immunoglobulin light chain variable region, and a linker peptide thereby forming a 
single-chain antibody capable of selectively binding an antigen. 

Examples of therapeutic antibodies that may be produced in methods of the 

15 invention include but are not limited to HERCEPTIN™ (Trastuzumab) (Genentech, 
CA) which is a humanized anti-HER2 monoclonal antibody for the treatment of 
patients with metastatic breast cancer; REOPRO™ (abciximab) (Centocor) which is 
an anti-glycoprotein Ilb/IIIa receptor on the platelets for the prevention of clot 
formation; ZENAPAX™ (daclizumab) (Roche Pharmaceuticals, Switzerland) which 

20 is an immunosuppressive, humanized anti-CD25 monoclonal antibody for the 
prevention of acute renal allograft rejection; PANOREX™ which is a murine anti-17- 
lA cell surface antigen IgG2a antibody (Glaxo Wellcome/Centocor); BEC2 which is a 
murine anti-idiotype (GD3 epitope) IgG antibody (ImClone System); 1MC-C225 
which is a chimeric anti-EGFR IgG antibody (ImClone System); VITAXIN™ which 

25 is a humanized anti-aVps integrin antibody (Applied Molecular 
Evolution/Medlmmune); Campath lH/LDP-03 which is a humanized anti CD52 IgGl 
antibody (Leukosite); Smart Ml 95 which is a humanized anti-CD33 IgG antibody 
(Protein Design Lab/Kanebo); RITUXAN™ which is a chimeric anti-CD20 IgGl 
antibody (IDEC Pharm/Genentech, Roche/Zettyaku); LYMPHOCIDE™ which is a 

30 humanized anti-CD22 IgG antibody (Immunomedics); ICM3 is a humanized anti- 
1CAM3 antibody (ICOS Pharm); IDEC-114 is a primate anti-CD80 antibody (IDEC 
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Pharm/Mitsubishi); ZEVALIN™ is a radiolabelled murine anti-CD20 antibody 
(IDEC/Schering AG); IDEC-131 is a humanized anti-CD40L antibody (IDEC/Eisai); 
IDEC-1 51 is a primatized anti-CD4 antibody (IDEC); IDEC-152 is a primatized anti- 
CD23 antibody (IDEC/Seikagalcu); SMART anti-CD3 is a humanized anti-CD3 IgG 
5 (Protein Design Lab); 5G1.1 is a humanized anti-complement factor 5 (CS) antibody 
(Alexion Pharm); D2E7 is a humanized anti-TNF-a antibody (CATIBASF); CDP870 
is a humanized anti-TNF-a Fab fragment (Celltech); IDEC-lSl is a primatized anti- 
CD4 IgGl antibody (IDEC Pharm/SmithKline Beecham); MDX-CD4 is a human anti- 
CD4 IgG antibody (Medarex/Eisai/Genmab); CDP571 is a humanized anti-TNF-a 
10 IgG4 antibody (Celltech); LDP-02 is a humanized anti-a4P7 antibody 
(LeukoSite/Genentech); OrthoClone OKT4A is a humanized anti-CD4 IgG antibody 
(Ortho Biotech); ANTOVA™ is a humanized anti-CD40L IgG antibody (Biogen); 
ANTEGREN™ is a humanized anti-VLA-4 IgG antibody (Elan); and CAT-152 is a 
human anti-TGF-Pa antibody (Cambridge Ab Tech). 

15 

Production ofHeterolozous Protein by Transgenic Avians 

One aspect of the present invention, therefore, concerns transgenic birds, such 
as chickens, comprising a recombinant nucleic acid molecule and which preferably 
(though optionally) express a heterologous gene in one or more cells in the animal. 

20 Suitable methods for the generation of transgenic avians having heterologous DNA 
incorporated therein are described, for example, in WO 99/19472 to Ivarie et ah ; WO 
00/11151 to Ivarie et al. .\ and WO 00/56932 to Harvev et al .. all of which are 
incorporated herein by reference in their entirety. 

Embodiments of the methods for the production of a heterologous polypeptide 

25 by the avian tissue such as the oviduct and the production of eggs which contain 
heterologous protein involve providing a suitable vector and introducing the vector 
into embryonic blastodermal cells together with an integrase, preferably phiCSl 
integrase, so that the vector can integrate into the avian genome. A subsequent step 
involves deriving a mature transgenic avian from the transgenic blastodermal cells 

30 produced in the previous steps. Deriving a mature transgenic avian from the 
blastodermal cells optionally involves transferring the transgenic blastodermal cells to 
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an embryo and allowing that embryo to develop fully, so that the cells become 
incorporated into the bird as the embryo is allowed to develop. Another alternative is 
to transfer a transfected nucleus to an enucleated recipient cell which may then 
develop into a zygote and ultimately an adult bird. The resulting chick is then grown 
5 to maturity. 

In an alternative embodiment, the cells of a blastodermal embryo are 
transfected or transduced with the vector and integrase directly within the embryo. It 
is contemplated, for example, that the recombinant nucleic acid molecules of the 
present invention may be introduced into a blastodermal embryo by direct 

10 microinjection of the DNA into a stage X or earlier embryo that has been removed 
from the oviduct. The egg is then returned to the bird for egg white deposition, shell 
development and laying. The resulting embryo is allowed to develop and hatch, and 
the chick allowed to mature. 

In one embodiment, a transgenic bird of the present invention is produced by 

15 introducing into embryonic cells such as, for instance, isolated avian blastodermal 
cells, a nucleic acid construct comprising an attB recombination site capable of 
recombining with a pseudo-attP recombination site found within the nuclear genome 
of the organism from which the cell was derived, and a nucleic acid fragment of 
interest, in a manner such that the nucleic acid fragment of interest is stably integrated 

20 into the nuclear genome of germ line cells of a mature bird and is inherited in normal 
Mendelian fashion. It is also within the scope of the invention that the targeted cells 
for receiving the transgene have been engineered to have a heterologous attP 
recombination site integrated into the nuclear genome of the cells, thereby increasing 
the efficiency of recognition and recombination with a heterologous attB site. 

25 In either case, the transgenic bird produced from the transgenic blastodermal 

cells is known as a "founder" Some founders can be chimeric or mosaic birds if, for 
example, microinjection does not deliver nucleic acid molecules to all of the 
blastodermal cells of an embryo. Some founders will carry the fransgene in the 
tubular gland cells in the magnum of their oviducts and will express the heterologous 

30 protein encoded by the transgene in their oviducts. If the heterologous protein 
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contains the appropriate signal sequences, it will be secreted into the lumen of the 
oviduct and onto the yolk of an egg. 

Some founders are germ-line founders. A germ-line founder is a founder that 
carries the transgene in genetic material of its germ-line tissue, and may also carry the 
5 transgene in oviduct magnum tubular gland cells that express the heterologous 
protein. Therefore, in accordance with the invention, the transgenic bird will have 
tubular gland cells expressing the heterologous protein and the ofTspring of the 
transgenic bird will also have oviduct magnum tubular gland cells that express the 
selected heterologous protein. (Alternatively, the offspring express a phenotype 

1 0 determined by expression df the exogenous gene in a specific tissue of the avian.) 

The invention can be used to express, in large yields and at low cost, a wide 
range of desired proteins including those used as human and animal pharmaceuticals, 
diagnostics, and livestock feed additives. Proteins such as growth hormones, 
cytokines, structural proteins and enzymes including human growth hormone, 

15 interferon, lysozjnme, and p-casein are examples of proteins which are desirably 
expressed in the oviduct and deposited in eggs according to the invention. Other 
possible proteins to be produced include, but are not limited to, albumin, a-1 
antitrypsin, antithrombin III, collagen, factors VIII, IX, X (and the like), fibrinogen, 
hyaluronic acid, insulin, lactoferrin, protein C, erythropoietin (EPO), granulocyte 

20 colony-stimulating factor (G-CSF), granulocyte macrophage colony-stimulating factor 
(GM-CSF), tissue-type plasminogen activator (tPA), feed additive enzymes, 
somatotropin, and chjonotrypsin. Immunoglobulins (shown, for example in Example 
10 below) and genetically engineered antibodies, including immunotoxins which bind 
to surface antigens on human tumor cells and destroy them, can also be expressed for 

25 use as pharmaceuticals or di^nostics. 

In various embodiments of the transgenic bird of the present invention, the 
expression of the transgene may be restricted to specific subsets of cells, tissues or 
developmental stages utilizing, for example, /raras-acting factors acting on the 
transcriptional regulatory region operably linked to the polypeptide-encoding region of 

30 interest of the present invention and which control gene expression in the desired 
pattern. Tissue-specific regulatory sequences and conditional regulatory sequences 
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can be used to control expression of the transgene in certain spatial patterns. 
Moreover, temporal patterns of expression can be provided by, for example, 
conditional recombination systems or prokaryotic transcriptional regulatory 
sequences. 

5 The stably modified oviduct cells will express the heterologous polynucleotide 

and deposit the resulting polypeptide into the egg white of a laid egg. For this 
purpose, the expression vector will further comprise an oviduct-specific promoter 
such as ovalbumin or ovomucoid operably linked to the desired heterologous 
polynucleotide. 

10 Another aspect of the present invention provides a method for the production 

in an avian of an heterologous protein capable of forming an antibody suitable for 
selectively binding an antigen. This method comprises a step of producing a 
transgenic avian incorporating at least one transgene, the transgene encoding at least 
one heterologous polypeptide selected from an immunoglobulin heavy chain variable 

1 5 region, an immunoglobulin heavy chain comprising a variable region and a constant 
region, an immunoglobulin light chain variable region, an immunoglobulin light chain 
comprising a variable region and a constant region, and a single-chain antibody 
comprising two peptide-Unked immunoglobulin variable regions. 

In one embodiment of this method, the isolated heterologous protein is an 

20 antibody capable of selectively binding to an antigen and which may be generated by 
combining at least one immunoglobulin heavy chain variable region and at least one 
immunoglobulin light chain variable region, preferably cross-linked by at least one 
disulfide bridge. The combination of the two variable regions generates a binding site 
that binds an antigen using methods for antibody reconstitution that are well known in 

25 the art. 

The present invention also encompasses immunoglobulin heavy and light 
chains, or variants or derivatives thereof, to be expressed in separate transgenic 
avians, and thereafter isolated from separate media including serum or eggs, each 
isolate comprising one or more distinct species of immunoglobulin polypeptide. The 
30 method may further comprise the step of combining a plurality of isolated 
heterologous immunoglobulin polypeptides, thereby producing an antibody capable of 
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selectively binding to an antigen. In this embodiment, for instance, two or more 
individual transgenic avians may be generated wherein one transgenic produces serum 
or eggs having an immunoglobulin heavy chain variable region, or a polypeptide 
comprising such, expressed therein. A second transgenic animal, having a second 
5 transgene, produces serum or eggs havmg an immunoglobulin light chain variable 
region, or a polypeptide comprising such, expressed therein. The polypeptides from 
two or more transgenic animals may be isolated from their respective sera and eggs 
and combined in vitro to generate a binding site capable of binding an antigen. 

The present invention is further illustrated by the following examples, which 

10 are provided by way of illustration and should not be construed as limiting. The 
contents of all references, published patents and patents cited throughout the present 
application are hereby incorporated by reference in their entireties. 

It will be apparent to those skilled in the art that various modifications, 
combinations, additions, deletions and variations can be made in the present invention 

15 without departing from the scope or spirit of the invention. For instance, features 
illustrated or described as part of one embodiment can be used in another embodiment 
to yield a still further embodiment. It is intended that the present invention covers 
such modifications, combinations, additions, deletions and veu-iations as come within 
the scope of the appended claims and their equivalents. 

20 

Example 1: Phage phiC31 integrase functions in avian cells. 

(a) A luciferase vector bearing either an attB (SEQ ID NO: 2 shown in Fig. 10) or attP 
(SEQ ID NO: 3 shown in Fig. 1 1) site was co-transfected with an integrase expression 
vector CMV-C31int (SEQ ID NO: 1) into DF-1 cells, a chiclcen fibroblast cell line. 
25 The cells were pass^ed several times and the luciferase levels were assayed at each 
passage. 

Cells were passaged every 3-4 days and one third of the cells were harvested 
and assayed for lucifersise. The expression of luciferase was plotted as a percentage of 
the expression measured 4 days after transfection. A luciferase expression vector 
30 bearing an attP site as a control was also included. 
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As can be seen in Fig. 2, in the absence of integrase, luciferase expression 
from a vector bearing attP or attB decreased to very low levels after several days. 
However, luciferase levels were persistent when the luciferase vector bearing attB was 
co-transfected with the integrase expression vector, indicating that the luciferase 
5 vector had stably integrated into the avian genome. 

(b) A drug-resistance colony formation assay was used to quantitate integration 
efficiency. The puromycin resistance expression vector pCMV-pur was outfitted with 
an attB (SEQ ID NO: 4 shown in Fig. 12) or an attP (SEQ ID NO: 5 shown in Fig. 13) 
sites. Puromycin resistance vectors bearing attB sites were cotransfected with phiCSl 

10 integrase or a control vector into DF-1 cells. One day after transfection, puromycin 
was added. Puromycin resistant colonies were counted 12 days post-transfection. 

In the absence of co-transfected integrase expression, few DF-1 cell colonies 
were observed after survival selection. When integrase was co-expressed, multiple 
DF-1 cell colonies were observed, as shown in Fig. 3. Similar to the luciferase 

IS expression experiment, the attB sequence (but not the attP sequence) was able to 
facilitate integration of the plasmid into the genome. Fig. 3 also shows that phiC3 1 
integrase functions at both 37° Celsius and 41° Celsius. Integrase also functions in 
quail cells using the puromycin resistance assay, as shown in Fig. 4. 

(c) The CMV-pur-attB vector (SEQ ID NO: 4) was also contransfected with an 
20 enhanced green fluorescent protein (EGFP) expression vector bearing an attB site 

(SEQ ID NO: 6 shown in Fig. 14) into DF-1 cells and the phiC31 integrase expression 
vector CMV-C31int (SEQ ID NO: 1). After puromycin selection for 12 days, the 
colonies were viewed with UV light , to determine the percentage of cells that 
expressed EGFP. Approximately 20% of puromycin resistant colonies expressed 

25 EGFP in all of the cells of the colony, as shown in Fig. 5, indicating that the integrase 
can mediate multiple integrations per cell. 

(d) PhiC3 1 integrase promoted the integration of large transgenes into avian 
cells. A puromycin expression cassette comprising a CMV promoter, puromycin 
resistance gene, polyadenylation sequence and the attB sequence was inserted into a 

30 vector containing a 12.0 kb lysozyme promoter and the human interferon a2b gene 
(SEQ ID NO: 7 shown in Fig. 15) and into a vector containing a 10.0 kb ovomucoid 
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promoter and the human interferon a2b gene (SEQ ID NO: 8) as shown in Fig. 16. 

DF-1 cells were transfected with donor plasmids of varying lengths bearing a 
puromycin resistance gene and an attB sequence in the absence or presence of an 
integrase expression plasmid. Puromycin was added to the culture mediate kill those 
5 cells which did not contain a stably integrated copy of the puromycin resistance gene. 
Cells with an integrated gene formed colonies in the presence of puromycin in 7-12 
days. The colonies were visualized by stfuning with methylene blue and the entire 60 
mm culture dish was imaged. 

PhiC31 integrase mediated the efficient integration of both vectors as shown in 

10 Fig. 7. 

Example 2: Cell culture methods. 

DF-1 cells were cultured in DMEM with high glucose, 10% fetal bovine 
serum, 2 mM L-glutamine, 100 units/ml penicillin and 100 jig/ml streptomycin at 37° 
15 Celsius and 5% CO2. A separate population of DF-1 cells was grown at 41° Celsius. 
These cells were adapted to the higher temperature for one week before they were 
used for experiments. 

Quail QT6 cells were cultured in FIO medium (Gibco) with 5% newborn calf 
serum, 1% chicken serum heat inactivated (at 55" Celsius for 45 mins), 10 units/ ml 
20 penicillin and 10 fig/ml streptomycin at 37° Celsius and 5% CO2. 

Example 3; Selection and Assay Methods 
(a) Puromycin selection assay: About 0.8 x 10^ DF-1 (chicken) or QT6 (quail) cells 
were plated in 60 mm dishes. The next day, the cells were transfected as follows: 

25 10 to 50 ng of a donor plasmid and 1 to 10 ng of an Integrase-expressing 

plasmid DNA were mixed with 150 jil of OptiMEM. 15 |iil of DMRIE-C was mixed 
with 1 50 p.1 of OptiMEM in a separate tube, and the mixtures combined and incubated 
for 1 5 mins. at room temperature. 

While the liposome/DNA complexes were forming, the cells were washed 

30 with OptiMEM and 2.5 ml of OptiMEM was added. After 15 minutes, 300 ^1 of the 
DNA-lipid mixture was added drop wise to the 2.5 ml of OptiMEM covering the cell 
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layers. The cells were incubated for 4-5 hours at either 37° Celsius or 41° Celsius, 5% 
CO2. The transfection mix was replaced with 3 mis of culture media. The next day, 
puromycin was added to the media at a final concentration of 1 ug/ml, and the media 
replaced every 2 to 4 days. Puromycin resistant colonies were counted or imaged 10- 
5 1 2 days after the addition of puromycin. 

(b) Luciferase assay: Chicken DF-1 or quail QT6 cells (0.8 x 10^) were plated in 60 
mm dishes. Cells were transfected as described above. The cells from a plate were 
transferred to a new 100 mm plate when the plate became confluent, typically on day 
3-4, and re-passaged every 3-4 days. 

10 At each time point, one-third of the cells from a plate were replated, and one- 

third were harvested for the luciferase assay. The cells were pelleted in an eppendorf 
tube and frozen at -yO^C. 

The cell pellet was lysed in 200 jil of lysis buffer (25 mM Tris-acetate, pH7.8, 
2mM EDTA, 0.5% Triton X-100, 5% glycerol). Sample (5|j,l) was assayed using the 

1 5 Promega BrightGlo reagent system. 

(c) Visualization of EGFP: EGFP expression was visualized with an inverted 
microscope with FITC illumination [Olympus 1X70, 100 W mercury lamp, HQ-FITC 
Band Pass Emission filter cube, exciter 480/40 nm, emission 535/50 nm, 20X phase 
contrast objective (total magnification was 2.5 x 10 x 20)]. 

20 (d) Staining of cell colonies: After colonies had formed, typically after 7-12 days of 
culture in puromycin medium, the cells were fixed in 2% formaldehyde, 0.2% 
glutaraldehyde for 15 mins, and stained in 0.2% methylene blue for 30 mins. followed 
by several washes with water. The plates were imaged using a standard CCD camera 
in visible light. 

25 

Example 4; Production of genetically transformed avian cells. 
Avian stage X blastodermal cells are used as the cellular vector for the 
transgenes. Stage X embryos are collected and the cells dispersed and mixed with 
plasmid DNA. The transgenes are then introduced to blastodermal cells via 
30 electroporation. The cells are immediately injected back into recipient embryos. 
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The cells are not cultured for any time period to ensure that they remain 
capable of contributing to the germline of resulting chimeric embryos. However, 
because there is no culture step, cells that bear the transgene cannot be identified. 
Typically, only a small percentage of cells introduced to an embryo will bear a stably 
5 integrated transgene (0.01 to 1%). To increase the percentage of cells bearing a 
transgene, therefore, the transgene vector bears an attB site and is co-electroporated 
with a vector bearing the CMV promoter driving expression of the phiC31 transgene 
(CMV-C31int (SEQ ID NO: 1)). The integrase then drives integration of the 
transgene vector into the nuclear genome of the avian cell and increases the 
1 0 percentage of cells bearing a stable transgene. 

(a) Preparation of avian stage X blastodermal cells: 

i) Collect fertilized eggs from Barred Rock or White leghorn chickens 
(Gallus gallus) or quail (Japonica cotumix) within 48 hrs. of laying; 

ii) Use 70% ethanol to clean the shells; 
15 iii) Crack the shells and open the eggs; 

iv) Remove egg whites by transferring yolks to opposite halves of shells, 
repeating to remove most of the egg whites; 

v) Put egg yolks with embryo discs facing up into a 1 0cm petri dish; 

vi) Use an absorbent tissue to gently remove egg white from the embryo discs; 
20 vii) Place a Whatman filter paper 1 ring over the embryos; 

viii) Use scissors to cut the membranes along the outside edge of the paper ring 
while gently lifting the ring/embryos with a pair of tweezers; 

ix) Insert the paper ring with the embryos at a 45 degrees angle into a petri 
dish containing PBS-G solution at room temperature; 

25 x) After ten embryo discs are collected, gently wash the yolks from the 

blastoderm discs using a Pasteur pipette under a stereo microscope; 
xi) Cut the discs by a hair ring cutter (a short piece of human hair is bent into a 
small loop and fastened to the narrow end of a Pasteur pipette with 
Parafilm); 

30 xii) Transfer the discs to a 1 5 ml sterile centrifuge tube on ice; 
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xiii) Place 10 to 15 embryos per tube and allow to settle to the bottom (about 5 
mins.); 

xiv) Aspirate the supernatant from the tube; 

XV) Add 5 mis of ice-cold PBS without Ca^ and Mg^, and gently pipette 4 to 
5 5 times using a 5 mis pipette; 

xvi) Incubate in ice for 5-7 mins. to allow the blastoderms to settle, and aspirate 
the supernatant; 

xvii) Add 3 mis of ice cold 0.05% trypsin/0.02% ETDA to each tube and gently 
pipette 3 to 5 times using a 5 ml pipette; 

10 xviii) Put the tube in ice for 5 mins. and then flick the tube by finger 40 times. 

Repeat; 

xix) Add 0.5 mis FBS and 3-5 mis BDC medium to each tube and gently 

pipette 5-7 times using a 5 ml pipette; 
XX) Spin at 500 rpm ( RCF 57 x g ) at 4' Celsius for 5 mins; 
1 5 xxi) Remove the supernatant and add 2 mis ice cold BDC medium into each 

tube; and 

xxii) Resuspend the cells by gently pipetting 20-25 times; and 

xxiii) Determine the cell titer by hemacytometer and ensure that about 95% of all 
BDCs are single cells, and not clumped. 

20 (b) Tmnsfection of linearized plasmids into blastodermal cells by small scale 
electroporation: 

i) Centrifuge the blastodermal cell suspension from step (xxiii) above at RCF 
57 X g, 4° Celsius, for 5 mins; 

ii) Resuspend cells to a density of 1-3 x 10* per ml with PBS without Ca^* 
25 and Mg^*; 

iii) Add linearized DNA, 1-30 fig per 1-3 x 10^ blastodermal cells in an 
eppendorf tube at room temperature. Add equimolar molar amounts of the 
non-linearized transgene plasmid bearing an attB site, and an integrase 
expression plasmid; 

30 iv) Incubate at room temperature for 10 mins; 
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v) Aliquot 100 jaI of the DNA-cell mixture to a 0.1 cm cuvette at room 
temperature; 

vi) Electroporate at 240 V and 25 fiFD (or 1 00 V and 125 nFD for quail cells) 
using, for example, a Gene Pulser IF** (BIO-RAD). 

5 vii) Incubate the cuvette at room temperature for 1-10 mins. 

viii) Before the electroporated cells are injected into a recipient embryo, they 
are transferred to a eppendorf tube at room temperature. The cuvette is 
washed with 350 |xl of media, which is transferred to the eppendorf, spun 
at room temperature and re-suspended in 0.01-0.3 ml medium; 
10 ix) Inject 1-10 (il of cell suspension into the subgerminal cavity of an non- 

irradiated or, preferably, an irradiated (e.g., with 300-900 rads) stage X 
egg. Shell and shell membrane are removed and, after injection, resealed 
according to U.S. Patent No. 6,397,777 incorporated herein by reference in 
its entirety; and 
15 x) The egg is then incubated to hatching. 

(c) Blastodermal Cell Culture Medium: 

i) 409.5 mis DMEM with high glucose, L-glutamine, sodium pyruvate, 
pyridoxins hydrochloride; 

ii) 5 mis Men non-essential amino acids solution, 10 mM; 
20 iii) 5 mis Penicillin-streptomycin 5000 U/ml each; 

iv) 5 mis L-glutamine, 200 mM; 

v) 75 mis fetal bovine serum; and 

vi) 0.5 mis p-mercaptoethanol, 1 1 .2mM. 



25 Example 5; Transfection of stage X embryos with attB plasmids 

(a) DNA-PEI: Twenty-five fig of a phage phiC31 integrase expression plasmid 
(pCMV-int), and 25 jig of a luciferase-expressing plasmid (pP-actin-GFP-attB) are 
combined in 200 nl of 28 mM Hepes (pH 7.4). The DNATHepes is mixed with an 
equal volume of PEI which has been diluted 10-foId with water. The DNA/Hepes/PEI 
30 is incubated at room temperature for 15 mins Three to seven ^il of the complex are 
injected into the subgerminal cavity of windowed stage X white leghorn eggs which 
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are then sealed and incubated as described in U.S. Patents No. 6,391,111. The 
complexes will also be incubated with blastodermal cells isolated from stage X 
embryos which are subsequently injected into the subgerminal cavity of windowed 
irradiated stage X white leghorn eggs. Injected eggs are sealed and incubated as 
5 described above. 
(b) Adenovirus-PEI: 

Two jAg of a phage phiC31 integrase expression plasmid (pCMV-int), 2 (ig of a GFP 
expressing plasmid (pp-actin-GFP-attB) and 2 jug of a luciferase expressing plasmid 
(pGLB) were incubated with 1.2 |il of JetPEI™ in 50 fil of 20 mM Hepes buffer 
10 (pH7.4). After 10 mins at 25°C, 3 x 10* adenovirus particles (Ad5-Null, Qbiogene) 
were added and the incubation continued for an additional 10 mins. Embryos are 
transfected in ovo or ex ovo as described above. 

Example 6: Stage I cytoplasmic iajection 
1 5 Production of transgenic chickens by cytoplasmic DNA injection using DNA 

injection directly into the germinal disk as described in Sang et al., Mol. Reprod. Dev., 
1: 98-106 (1989); Love et al.. Biotechnology, 12: 60-63 (1994) incorporated herein by 
reference in their entireties. 

In the method of the present invention, fertilized ova, and preferably stage I 
20 embryos, are isolated from euthanized hens 45 mins. to 4 hrs. after oviposition of the 
previous egg. Alternatively, eggs were isolated from hens whose oviducts have been 
fistulated according to the techniques of Gilbert & Wood-Gush, J. Reprod. Fertil., 5: 
45 1 -453 (1 963) and Pancer et al., Br. Poult. ScL, 30: 953-7 (1989) incorporated herein 
in their entireties. 

25 An isolated ovum was placed in dish with the germinal disk upwards. 

Ringer's buffer medium was then added to prevent drying of the ovum. Any suitable 
microinjection assembly and methods for microinjecting and reimplanting avian eggs 
are useful in the method of cytoplasmic injection of the present invention. A 
particularly suitable apparatus and method for use in the present invention is described 

30 in U.S. Patent Application Serial No: 09/919,143 ("the '143 Application) and 
incorporated herein by reference in its entirety. The avian microinjection system 
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described in the '143 Application allowed the loading of a DNA solution into a 
micropipette, followed by prompt positioning of the germinal disk under the 
microscope and guided injection of the DNA solution into the germinal disk. Injected 
embryos could then be surgically transferred to a recipient hen as described, for 
5 example, in Olsen & Neher, J. Exp. Zool, 109: 355-66 (1948) and Tanaka et al, J. 
Reprod. Fertil, 100: 447-449 (1994). The embryo was allowed to proceed through 
the natural in vivo cycle of albumin deposition and hard-shell formation. The 
transgenic embryo is then laid as a hard-shell egg which was incubated until hatching 
of the chick. Preferably, injected embryos were surgically transferred to recipient 

10 hens via the ovum transfer method of Christmann et al. in PCT/USO 1/26723, the 
contents of which are incorporated by reference in its entirety, and hard shell eggs 
were incubated and hatched. 

Approximately 25 nl of DNA solution (about 60ng/nl) with either integrase 
mRNA or protein were injected into a germinal disc of stage I White Leghorn 

15 embryos obtained 90 minutes after oviposition of the preceding egg. Typically the 
concentration of integrase mRNA used was 100 ng/p.1, and the concentration of 
integrase protein was 66 ng/fxl. 

To synthesize the integrase mRNA, a plasmid template encoding the integrase 
protein was linearized at the 3' end of the transcription unit. mRNA was synthesized, 

20 capped and a polyadenine tract added using the mMESSAGE mMACHINE T7 Ultra 
KifTM (Ambion, Austin, TX). The mRNA was purified by extraction with phenol and 
chloroform and precipitiated with isopropanol. The integrase protein was expressed 
in E. coli and purified as described by Thorpe et al, Mol. Microbiol, 38: 232-241 
(2000). 

25 A plasmid encoding for the integrase protein is transfected into the target cells. 

However, since the early avian embryo transcriptionally silent until it reaches about 

22,000 cells, injection of the integrase mRNA or protein was expected to result in 

better rates of transgenesis, as shovm in the Table 2 below. 

The chicks produced by this procedure were screened for the presence of the 
30 injected transgene using a high throughput PCR-based screening procedure as 

described in Harvey et al. Nature Biotech, 20: 396-399 (2002). 
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Table 2: Summary of cytoplasmic injection results using different integrase strategies 



Experimental 
group 


Ovum 
transfers 


Hard shells 
produced (%) 


Chicks 
hatched (%) * 


Transgenic 
chicks (%) ^ 


No Integrase 


5164 


3634 (70%) 


500 (14%) 


58(11.6%) 


Integrase 
mRNA 


1109 


833 (75%) 


115(13.8%) 


19(16.5%) 


Integrase 
protein 


374 


264 (70.6%) 


47(17.8%) 


16 (34%) 


* : Percentages based on the number of hard s 


lells 



Percentages based on the number of hatched birds 



5 Example 7: Characterization of phiC31 integrase-mediated integration sites in 

the chicken genome. 
To characterize phiC31 -mediated integration into the chicken genome, a 
plasmid rescue method was used to isolate integrated plasmids from transfected and 
selected chicken fibroblasts. Plasmid pCR-XL-TOPO-CMV-pur-attB (SEQ ID NO: 
10 10, shown in Fig, 18) does not have BamH I or Bgl II restriction sites. Genomic DNA 
from cells transformed with pCR-XL-TOPO-CMV-pur-attB was cut with BamH I or 
Bgl II (either or both of which would cut in the flanking genomic regions) and 
religated so that the genomic DNA surrounding the integrated plasmid would be 
captured into the circularized plasmid. The flanking DNA of a number of plasmids 
1 5 were then sequenced. 

DF-1 cells (chicken fibroblasts), 4 x 10^ were transfected with 50 ng of pCR- 
XL-TOPO-CMV-pur-attB and 1 jig of pCMV-int. The following day, the culture 
medium was replaced with fresh media supplemented with 1 jig/ml puromycin. After 
10 days of selection, several hundred puromycin-resistant colonies were evident. 
20 These were harvested by trypsinzation, pooled, replated on 1 0 cm plates and grown to 
confluence. DNA was then extracted. 

Isolated DNA was digested with BamH I and Bgl U for 2-3 hrs, extracted with 
phenol:chloroform:isoamyl alcohol chloroform :isoamyl alcohol and ethanol 
precipitated. T4 DNA ligase was added and the reaction incubated for 1 hr at room 
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temperature, extracted with phenohchloroformrisoamyl alcohol and 
chloroform:isoamyl alcohol, and precipitated with ethanol. 5 \il of the DNA 
suspended in 1 0\i\ of water was electroporated into 25 \i\ of Genehogs^'^ (Invitrogen) 
in an 0.1 cm cuvette using a GenePulser II (Biorad) set at 1.6 kV, 100 ohms, 25 uF 

5 and plated on Luria Broth (LB) plates with 5 p.g/ml phleomycin (or 25 jig/ml zeocin) 
and 20 (ig/ml kanatnycin. Approximately ICQ individual colonies were cultured, the 
plasmids extracted by standard miniprep techniques and digested with Xba I to 
identify clones with unique restriction fiagments. 

Thirty two plasmids were sequenced with the primer attB-for (5- 

10 TACCGTCGACGATGTAGGTCACGGTC-3') (SEQ ID NO: 12) which allows 
sequencing across the crossover site of attB and into the flanking genomic sequence. 
All of plasmids sequenced had novel sequences inserted into the crossover site of 
attB, indicating that the clones were derived from plasmid that had integrated into the 
chicken genome via phiCSl integrase-mediated recombination, 

1 5 The sequences were compared with sequences at GenBank using Basic Local' 

Alignment Search Tool (BLAST). Most of the clones harbored sequences 
homologous to Gallus genomic sequences in the TRACE database. 

Example 8; Insertion of a wild-type attP site into the avian genome augments 
integrase-mediated integration and transgenesis. 

The chicken B-cell line DT40 cells (Buerstedde et al, E.M.B.O. J., 9: 921-927 
(1990)) are useful for studying DNA integration and recombination processes 
(Buerstedde & Takeda, Cell, 67:179-88 (1991)). DT40 cells were engineered to 
harbor a wild-type attP site isolated from the Streptomyces phage phiC31. Two 
independent cell lines were created by transfection of a linearized plasmid bearing an 
attP site linked to a CMV promoter driving the resistance gene to G418 (DT40-NLB- 
attP) or bearing an attP site linked to a CMV promoter driving the resistance gene for 
puromycin (DT40-pur-attP). The transfected cells were cultured in the presence of 
G418 or puromycin to enrich for cells bearing an attP sequence stably integrated into 
the genome. 



20 



25 



30 
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A super-coiled luciferase vector bearing an attB (SEQ ID NO: 2 shown in Fig. 
10) was co-transfected, together with an integrase expression vector CMV-C31int 
(SEQ ID NO: 1) or a control, non-integrase expressing vector (CMV-BL) into wild- 
type DT40 cells and the stably transformed lines DT40-NLB-attP and DT40-pur-attP. 
5 Cells were passaged at 5, 7 and 14 days post-transfection and about one third 

of the cells were harvested and assayed for luciferase. The expression of luciferase 
was plotted as a percentage of the expression measured 5 days after transfection. As 
can be seen in Fig. 21, in the absence of integrase, or in the presence of integrase but 
in the DT40 cells lacking an inserted wild-type attP site, luciferase expression from a 

10 vector bearing attB progressively decreased to very low levels. However, luciferase 
levels were persistent when the luciferase vector bearing attB was co-transfected with 
the integrase expression vector into the attP bearing cell lines DT40-NLB-attP and 
DT40-pur-attP. Inclusion of an attP sequence in the avian genome augments the level 
of integration efficiency beyond that afforded by the utilization of endogenous 

1 5 pseudo-attP sites. 

Example 9: Generation of attP transgenic cell line 
and birds using an NLB vector 
The NLB-attP retroviral vector can be injected into stage X chicken embryos 
20 laid by pathogen-free hens. A small hole is drilled into the egg shell of a freshly laid 
egg, the shell membrane cut away and the embryo visualized by eye. With a drawn 
needle attached to a syringe, 1 to 10 |J,1 of concentrated retrovirus, approximately 2.5 x 
10^ lU, is injected into the subgerminal cavity of the embryo. The egg shell is resealed 
with a hot glue gun. Suitable methods for the manipulation of avian eggs, including 
25 opening and resealing hard shell eggs are described in U.S. Patent Serial Nos: 
5,897,998 and 6,397,777 which are herein incorporated by reference in their entireties. 

Typically, 25% of embryos hatch 21 days later. The chicks are raised to sexual 
maturity and semen samples are taken. Birds that have a significant level of the 
transgene in sperm DNA will be identified, typically by a PCR-based assay. Ten to 
30 25% of the hatched roosters will be able to give rise to Gl transgenic offspring, 1 to 
20% of which may be transgenic. DNA extracted fi-om the blood of Gl offspring is 
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analyzed by PGR and Southern analysis to confirm the presence of the intact 
transgene. Several lines of transgenic roosters, each with a unique site of attP 
integration, are then bred to non-transgenic hens, giving 50% of G2 transgenic 
offspring. Transgenic G2 hens and roosters from the same line can be bred to produce 
5 G3 offspring homozygous for the transgene. Homozygous offspring will be 
distinguished from hemizygous offspring by quantitative PGR. The same procedure 
can be used to integrate an attB or attP site into transgenic birds. 

Example 10; Expression of immunoglobulin chain polypeptides by transgenic 

chickens 

Bacterial artificial chromosomes (BACs) containing a 70 kbp segment of the 
chicken ovomucoid gene with the light and heavy chain cDNAs for a human 
monoclonal antibody inserted along with an internal ribosome entry site into the 3' 
untranslated region of the ovomucoid gene were equipped with the attB sequence. 
The heavy and light chain cDNAs were inserted into separate ovomucoid BACs such 
that expression of an intact monoclonal antibody requires the presence of both BACs 
in the nucleus. 

Several hens produced by coinjection of the attB-bearing ovomucoid BACs 
and integrase-encoding mRNA into stage I embryos produced intact monoclonal 
antibodies in their egg white.. One hen, which had a high level of the light chain 
ovomucoid BAG in her blood DNA as determined by quantitative PGR particularly 
expressed the light chain portion of the monoclonal antibody in the egg white at a 
concentration of 350 nanograms per ml, or approximately 12 jig per egg. 

25 Example 11: Stage I cytoplasmic injection with integrase activity and PEI 

Production of transgenic chickens by cytoplasmic DNA injection directly into 
the germinal disk was done as described in Example 6. 

Approximately 25 nl of aqueous DNA (about 60ng/|xl) which includes a 
transgene is placed in solution with integrase mRNA or integrase protein was mixed 
30 with an equal volume of PEI that had been diluted ten fold. The mixture was injected 
into a germinal disc of stage I White Leghorn embryos obtained about 90 minutes 
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after oviposition of the preceding egg. Typically the concentration of integrase mRNA 
used was about 100 ng/|xl, and the concentration of integrase protein was about 66 
ng/(Al. The integrase mRNA was synthesized according to Example 6. 

Transgenic chicks produced by this procedure using: integrase mRNA/PEI and 
5 integrase protein/PEI showed positive results for the presence of heterologously 
expressed protein in the blood, semen and egg white. 

Example 12: Stage I cytoplasmic injection with integrase activity and NLS 
Production of transgenic chickens by cytoplasmic DNA injection directly into 
1 0 the germinal disk was done as described in Example 6. 

DNA which includes a transgene was suspended in 0.25 M KCl and SV40 T 
antigen nuclear localization signal peptide (NLS peptide, amino acid sequence 
CGGPKKKRKVG (SEQ ID NO: 13)) was added to achieve a peptide DNA molar 
ratio of 100:1 . The DNA (about 60ng/nl) was allowed to associate with the SV40 T 
1 5 antigen NLS peptide by incubating at 25 degrees C for about 1 5 minutes. 

Integrase mRNA or integrase protein was added to approximately 25 nl of an 
aqueous DNA/NLS solution, typically, to produce a final concentration of integrase 
mRNA of about 50 ng/jxl, or an integrase protein concentration of about 33 ng/jil. The 
mixture was injected into a germinal disc of stage I White Leghorn embryos obtained 
20 about 90 minutes after oviposition of the preceding egg. The integrase mRNA was 
synthesized as according to Example 6. 

Transgenic chicks produced by this procedure using: integrase mRNA/NLS 
and integrase protein/NLS showed positive results for the presence of heterologously 
expressed protein in blood, semen and egg white. 

25 

Example 13; Dispersing of plasmid DNA in avian stage I embryos 
DNA samples are Cy3 labeled with a Cy3 ULS labeling kit (Amersham 
Pharmacia Biotech). Briefly, plasmid DNA (1 jig) is first sheared to approximately 
100 to 500 bp fragments by sonication. Resulting DNA is incubated at 65°C for 15 
30 min in Cy3 ULS labeling solution and unincorporated Cy3 dye is removed by spin 
column chromatography (CentriSep, Princeton Separations). The distribution of the 
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DNA in stage I avian embryos was visualized after introduction into the stage I avian 
embryo. Enough high molecular weight or low molecular weight PEI was added to 
the DMA to coat the DNA. Typically, PEI was added to the DNA to a concentration 
of about 5%. 

5 Figure 22 shows an avian stage one embryo containing Cy3 labeled naked 

DNA, In Figure 22 it can be seen that the DNA is localized to certain areas of the 
embryo. Figure 23 and Figure 24 show an avian stage one embryo containing Cy3 
labeled DNA coated with low molecular (22 kD) weight PEI (Figure 23) and high 
molecular weight (25 kD) PEI (Figure 24). In Figures 23 and 24, it can be seen that 

1 0 the DNA is dispersed throughout the embryos. 

These experiments show that DNA/PEI conjugates are distributed more 
uniformly in the cytoplasm of injected embryos when compared with naked DNA 

While this invention has been described with respect to various specific 
15 examples and embodiments, it is to be understood that the invention is not limited 
thereto and that it can be variously practiced with the scope of the following claims. 
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What is Claimed Is : 

1 . A method of producing a transgenic avian comprising: 

introducing into an avian cell a nucleic acid comprising a transgene, an 
integrase activity and a cationic polymer; 
5 introducing the avian cell into a recipient avian wherein the recipient 

avian produces an offspring which includes the transgene, 

thereby producing a transgenic avian. 

2. The method of claim 1 wherein introducing the nucleic acid is done by 
10 a method selected from the group consisting of microinjecting, transfection, 

electroporation and lipofection. 

3. The method of claim 1 wherein introducing the nucleic acid is done by 
microinjecting. 

1 15 

4. The method of claim 1 wherein an integrase protein is introduced into 
the cell. 

5. The method of claim 1 wherein a nucleic acid encoding an integrase is 
20 introduced into the cell. 

6. The method of claim 5 wherein the nucleic acid encoding integrase is 
mRNA. 

25 7. The method of claim 1 wherein a nuclear localization signal is 

introduced into the cell. 

8. The method of claim 7 wherein the nuclear localization signal is 
associated with the nucleic acid comprising a transgene. 

30 
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9. 



The method of claim 7 wherein the nuclear localization signal is 



associated with the nucleic acid comprising a transgene by a chemical bond, 

10. The method of claim 7 wherein the localization signal is associated 
5 with the nucleic acid comprising a transgene by an ionic bond. 

11. The method of claim 1 wherein the transgene comprises a coding 
sequence which is expressed in a cell of the transgenic avian producing a polypeptide. 

10 12. The method of claim 1 1 wherein the coding sequence is expressed in 

the blood of the transgenic avian. 

13. The method of claim 11 wherein the coding sequence is expressed in 
the sperm of the transgenic avian. 



14. The method of claim 11 wherein the polypeptide is present in egg 
white produce by the transgenic avian. 

15. The method of claim 11 wherein the coding sequence is for a light 
20 chain or a heavy chain of an antibody. 



15 



16. 



The method of claim 1 5 wherein the antibody is a human antibody. 



17. 



The method of claim 1 1 wherein the coding sequence is for a cytokine. 



25 



18. 



The method of claim 17 wherein the cytokine is interferon. 



19. 



The method of claim 1 wherein the avian cell is an avian embryo cell. 



30 



20. The method of claim 1 wherein the avian cell is a cell of an early stage 
avian embryo comprising a germinal disc. 
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21. The method of claim 1 wherein the avian cell is an avian embryo cell 
selected from the group consisting of stage I avian embryo, stage II avian embryo, 
stage III avian embryo, stage IV avian embryo, stage V avian embryo, stage VI avian 
5 embryo, stage VII avian embryo, stage VIII avian embryo, stage IX avian embryo, 
stage X avian embryo, stage XI avian embryo and stage XII avian embryo. 

21. The method of claim 1 wherein the avian cell is a cell of a stage X 
avian embryo. 

10 

22. The method of claim 1 wherein the cationic polymer comprises one or 
more compounds selected firom the group consisting of polyethylenimine, polylysine, 
DEAE-dextran, starburst dendrimers and starburst polyamidoamine dendrimers. 

15 23. The method of claim 1 wherein the cationic polymer comprises 

polyethylenimine. 

24. The method of claim 1 wherein the avian is a chicken. 

20 25. The transgenic avieui produced according to claim 1. 

26. An egg produced by a transgenic avian of claim 1 . 

27. The method of claim 1 wherein the method has an increased efficiency 
25 of transgenic avian production relative to an identical method without the integrase or 

cationic polymer. 

28. A method of producing a tratisgenic avian comprising: 
introducing into an avian cell a nucleic acid comprising a transgene, an 

30 integrase activity and a nuclear localization signal; 
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introducing tlie avian cell into a recipient avian wherein the recipient 
avian produces an offspring which includes the transgene, 
thereby producing a transgenic avian. 

5 29. The method of claim 28 wherein introducing the nucleic acid is done 

by a method selected from the group consisting of microinjecting, transfection, 
electroporation and lipofection. 

30. The method of claim 28 wherein introducing the nucleic acid is done 
10 by microinjecting. 

3 1 . The method of claim 28 wherein an integrase protein is introduced into 
the cell. 

15 32. The method of claim 28 wherein a nucleic acid encoding an integrase 

is introduced into the cell. 

33. The method of claim 32 wherein the nucleic acid encoding integrase is 
mRNA. 

20 

34. The method of claim 28 wherein a nuclear localization signal is 
introduced into the cell. 

35. The method of claim 34 wherein the nuclear localization signal is 
25 associated with the nucleic acid comprising a transgene. 

36. The method of claim 34 wherein the nuclear localization signal is 
associated with the nucleic acid comprising a transgene by a chemical bond. 

30 37. The method of claim 34 wherein the localization signal is associated 

with the nucleic acid by an ionic bond. 
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38. The method of claim 28 wherein the transgene comprises a coding 
sequence which is expressed in a cell of tiie transgenic avian producing a polypeptide. 

5 39. The method of claim 38 wherein the coding sequence is expressed in 

the blood of the transgenic avian. 

40. The method of claim 38 wherein the coding sequence is expressed in 
the sperm of the transgenic avian. 

10 

41. The method of claim 38 wherein the polypeptide is present in egg 
white produce by the transgenic avian. 

42. The method of claim 38 wherein the coding sequence is for a light 
1 5 chain or a heavy chain of an antibody. 



43. The method of claim 42 wherein the antibody is a human antibody. 

44. The method of claim 38 wherein the coding sequence is for a cytokine. 

45. The method of claim 44 wherein the cytokine is interferon. 

46. The method of claim 28 wherein the cell is an avian embryo cell. 



20 



25 47. The method of claim 28 wherein the avian cell is a cell of an early 

stage avian embryo comprising a germinal disc. 

48. The method of claim I wherein the avian cell is an avian embryo cell 
selected from the group consisting of stage I avian embryo, stage II avian embryo, 
30 stage III avian embryo, stage IV avian embryo, stage V avian embryo, stage VI avian 



-71 - 



wo 2004/092351 



PCT/US2004/009253 



embryo, stage VII avian embryo, stage VUI avian embryo, stage IX avian embryo, 
stage X avian embryo, stage XI avian embryo and stage XII avian embryo. 

49. The method of claim 28 wherein the avian cell is a cell of a stage X 
5 avian embryo. 

50. The method of claim 28 wherein the cationic polymer comprises one or 
more compounds selected from the group consisting of polyethylenimine, polylysine, 
DEAE-dextran, starburst dendrimers and starburst polyamidoamine dendrimers. 

10 

51. The method of claim 28 wherein the cationic polymer comprises 
polyethylenimine. 

52. The method of claim 28 wherein the avian is a chicken. 

15 

53. The transgenic avian produced according to claim 28. 

54. An egg produced by a transgenic avian of claim 28. 

20 55. The method of claim 28 wherein the method has an increased 

efficiency of transgenic avian production relative to an identical method without the 
integrase or nuclear localization signal. 

56. A method of dispersing nucleic acid in a cell comprising: 
introducing into a cell a nucleic acid and a dispersing agent in an 

25 amount that will disperse the nucleic acid in a cell 
thereby dispersing nucleic acid in a cell. 

57. The method of claim 56 wherein the cell is an avian cell. 
30 58. The method of claim 56 wherein the cell is an embryo cell 
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59. The method of claim 56 wherein the nucleic acid includes a transgene. 

60. The method of claim 56 wherein NLS or integrase activity is 
5 introduced into the cell. 

61. The method of claim 57 including introducing the avian cell into a 
recipient avian wherein the recipient avian produces an offspring which includes the 
transgene, 

10 

62. The method of claim 56 wherein the dispersing is a homogeneous 
dispersing. 

63. The method of claim 56 wherein the dispersing agent is a cationic 
1 5 polymer. 

64. The method of claim 56 wherein the cationic polymer comprises one or 
more compounds selected from the group consisting of polyethylenimine, polylysine, 
DEAE-dextran, starburst dendrimers and starburst polyamidoamine dendrimers. 

65. The method of claim 56 wherein the dispersing agent is 
20 polyethylenimine. 



25 
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pCMV-C31int (SEQ ID NO: 1) 

CATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGATCGGTGCGGGCCTCTTCGCTATT 

ACGCCAGCCAATACGCAAACCGCCTCTCCCCGCGCGTTGGCCGATTCATTAATGCAGGATCG 

ATCCAGACATGATAAGATACATTGATGAGTTTGGACAAACCACAACTAGAATGCAGTGAAAA 

AAATGCTTTATTTGTGAAATTTGTGATGCTATTGCTTTATTTGTAACCATTATAAGCTGCAA 

TAAACAAGTTAACAACAACAATTGCATTCATTTTATGTTTCAGGTTCAGGGGGAGGTGTGGG 

AGGTTTTTTAAAGCAAGTAAAACCTCTACAAATGTGGTATGGCTGATTATGATCATGAACAG 

ACTGTGAGGACTGAGGGGCCTGAAATGAGCCTTGGGACTGTGAATCTAAAATACACAAACAA 

TTAGAATCACTAGCTCCTGTGTATAATATTTTCATAAATCATACTCAGTAAGCAAAAGTCTC 

AAGCAGCAAGCATATGCAGCTAGTTTAACACATTATACACTTAAAAATTTTATATTTACCTT 

AGAGCTTTAAATCTCTGTAGGTAGTTTGTCCAATTATGTCACACCACAGAAGTAAGGTTCCT 

TCACAAAGATCCCAAGCTAGCTTATAATACGACTCACTATAGGGAGAGAGCTATGACGTCGC 

ATGCACGCGTAAGCTTGGGCCCCTCGAGGGATCCGGGTGTCTCGCTACGCCGGTACGTCTTC 

CGTGCCGTCCTGGGCGTCGTCTTCGTCGTCGTCGGTCGGCGGCTTCGCCCACGTGATCGAAG 

CGCGCTTCTCGATGGGCGTTCCCTGCCCCCTGCCCGTAGTCGACTTCGTGACAACGATCTTG 

TCTACGAAGAGCCCGACGAACACGCGCTTGTCGTCTACTGACGCGCGCCCCCACCACGACTT 

AGGGCCGGTCGGGTCAGCGTCGGCGTCTTCGGGGAACCATTGGTCAAGGGGAAGCTTCGGGG 

CTTCGGCGGCTTCAAGTTCGGCAAGCCGCTCTTCCGCCCCTTGCTGCCGGAGCGTCAGCGCT 

GCCTGTTGCTTCCGGAAGTGCTTCCTGCCAACGGGTCCGTCGTACGCGCCTGCCGCGCGGTC 

TTCGTACAGCTCTTCAAGGGCGTTCAGGGCGTCGGCGCGCTCCGCAACAAGGTTCGCCCGTT 

CGCCGCTCTTCTCAGGCGCCTCAGTGAGCTTGCCGAAGCGTCGGGCGGCTTCCCACAGAAGC 

GCCAACGTCTCTTCGTCGCCTTCGGCGTGCCTGATCTTGTTGAAGATGCGTTCCGCAACGAA 

CTTGTCGAGTGCCGCCATGCTGACGTTGCACGTGCCTTCGTGCTGCCCAGGTGCGGACGGGT 

CGACCACCTTCCGGCGACGGCAGCGGTAAGAGTCCTTGATCGATTCTTCCCCGCGCTTCGAA 

GTCATGACGGCGCCACACTCGCAGTACAGCTTGTCCATGGCGGACAGAATGGCTTGCCCCCG 

GGAAAGCCCCTTGCCGCGCCCCCTGCCGTCCAACCACGCCTGAAGCTCATACGACTCAGCGG 

GCTCGATGATCGGTCCGCAATCAAGCTCGACCGGCCGGAGCGTGATCGGGTCGCGCTGAATG 

CGGTAACCCTCAATCTTCGTGGTCGGCGTGCCGTCCGGCTTCTTCTTGTAGATCACCTCAGC 

GGCGAAGCCCGCAATACGCGGGTCCCGAAGGATTCGCATAACGGTTGCCGGGTCCCAGGCGC 

TTGAAGCGGTCTTCTTCCCAATCGTCTCGCCCCGGGTCGGCACGGCGTCAGCGTCCATGCGC 

TTACAAAGCCCCGTGATGCTGCCCGGGTGAATGGCGGCTTGACTGCCCGGCTTGAAGGGAAG 

GTGTTTGTGCGTCTTGATCTCACGCCACCACCACCGGATTACGTCGGGCTCGAACTCGAAGG 

GTCCGGTAAGGGGAGTGGTCGAGTGCGCAAGCTTGTTGATGACGACATTGACCATTCGGCCG 

TTGCGCGTGATCTCCTTCGTCTCCGAAACAAGCTCGAAGCCGTAAGGCGCCTTCCCGCCGAC 

GTACCCGCCCAATTCGCGCTGAAGGTTCTTCGTGTCGAGAATCTTCGCCGACTTCAGCGAAG 

ATTCTTTGTGCGACGCGTCGAGCCGCATAATCAGGTGAATCAGGTCCATGACGTTTCCCTGC 

CGGAAGACGCCTTCCTGAGTGGAAACAATCGTCACGCCCAGGGCGAGCAATTCCGAGACAAT 

CGGAATCGCGTCCATGACCTTCAGGCGCGAGAAGCGCGACACGTCATAGACAATGATCATGT 

TGAGCCGCCCGGCGCGGCATTCGTTCAGGATGCGTTCGAACTCCGGGCGCTCCGCCGTCCCG 

AACGCCGACGTGCCGGGCGCTTCGCTGAAATGCCCGACGAACCTGAACCGGCCCCCGTCGCG 

CTCGACTTCGCGCTGAAGGTCGGCCGCCTTGTCTTCGTTGGCGCTACGCTGTGTCGCTGGGC 

TTGCTGCGCTCGAATTCTCGCGCTCGCGCGACTGACGGTCGTAAGCACCCGCGTACGTGTCC 

ACCCCGGTCACAACCCCTTGTGTCATGTCGGCGACCCTACGACTAGTGAGCTCGTCGACCCG 

GGAATTCCGGACCGGTACCTGCAGGCGTACCTTCTATAGTGTCACCTAAATAGCTTTTTGCA 

AAAGCCTAGGCTAGAGTCCGGAGGCTGGATCGGTCCCGGTGTCTTCTATGGAGGTCAAAACA 

GCGTGGATGGCGTGTCCAGGCGATCTGACGGTTCACTAAACGAGCTCTGCTTATATAGACCT 

CCCACCGTACACGCCTACCGCCCATTTGCGTCAATGGGGCGGAGTTGTTACGACATTTTGGA 

AAGTCCCGTTGATTTTGGTGCCAAAACAAACTCCCATTGACGTCAATGGGGTGGAGACTTGG 

AAATCCCCGTGAGTCAAACCGCTATCCACGCCCATTGATGTACTGCCAAAACCGCATCACCA 

TGGTAATAGCGATGACTAATACGTAGATGTACTGCCAAGTAGGAAAGTCCCATAAGGTCATG 

TACTGGGCATAATGCCAGGCGGGCCATTTACCGTCATTGACGTCAATAGGGGGCGTACTTGG 

CATATGATACACTTGATGTACTGCCAAGTGGGCAGTTTACCGTAAATACTCCACCCATTGAC 

GTCAATGGAAAGTCCCTATTGGCGTTACTATGGGAACATACGTCATTATTGACGTCAATGGG 

CGGGGGTCGTTGGGCGGTCAGCCAGGCGGGCCATTTACCGTAAGTTATGTAACGACCTGCAC 
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GATGCTGTTTCCTGTGTGAAATTGTTATCCGCTCACAATTCCACACATTATACGAGCCGGAA 

GCTATAAAGTGTAAAGCCTGGGGTGCCTAATGAGTGAAAGGGCCTCGTATACGCCTATTTTT 

ATAGGTTAATGTCATGATAATAATGGTTTCTTAGACGTCAGGTGGCACTTTTCGGGGAAATG 

TGCGCGGAACCCCTATTTGTTTATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAGA 

CAATAACCCTGATAAATGCTTCAATAATATTGAAAAACGCGCGAATTGCAAGCTCTGCATTA 

ATGAATCGGCCAACGCGCGGGGAGAGGCGGTTTGCGTATTGGGCGCTCTTCCGCTTCCTCGC 

TCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGGCG 

GTAATACGGTTATCCACAGAATCAGGGGATAACGCAGGAAAGAACATGTGAGCAAAAGGCCA 

GCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCC 

CTGACGAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAA 

AGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCT 

TACCGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCAATGCTCACGCT 

GTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCC 

GTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACA 

CGACTTATCGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCG 

GTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGT 

ATCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAA 

ACAAACCACCGCTGGTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAA 

AAGGATCTCAAGAAGATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAAC 

TCACGTTAAGGGATTTTGGTCATGCCATAACTTCGTATAGCATACATTATACGAAGTTATGG 

CATGAGATTATCAA2^AAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAAT 

CAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCA 

CCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGAT 

AACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGACCCAC 

GCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGT 

GGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTJ^G 

TAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTACAGGCATCGTGGTGTCAC 

GCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACATQA 

TCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAA 

GTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGC 

CATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGT 

ATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAATACGGGATAATACCGCGCCACATAGCAG 

AACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTAC 

CGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTT 

ACTTTCACCy^GCGTTTCTGGGTGAQCa^Al^AACAGGAAGGCAAAATGCCGCAAAAAAGGGAA 

AAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTT 

ATCAGGGTTATTGTCTCATGCCAGGGGTGGGCACACATATTTGATACCAGCGATCCCTACAC 

AGCACATAATTCAATGCGACTTCCCTCTATCGCACATCTTAGACCTTTATTCTCCCTCCAGC 

ACACATCGAAGCTGCCGAGCAAGCCGTTCTCACCAGTCCAAGACCTGGCATGAGCGGATACA 

TATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTG 

CCACCTGAAATTGTAAACGTTAATATTTTGTTAAAATTCGCGTTAAATTTTTGTTAAATCAG 

CTCATTTTTTAACCAATAGGCCGAAATCGGCAAAATCCCTTATAAATCAAAAGAATAGACCG 

AGATAGGGTTGAGTGTTGTTCCAGTTTGGAACAAGAGTCCACTATTAAAGAACGTGGACTCC 

AACGTCAAAGGGCGAAAAACCGTCTATCAGGGCGATGGCCCACTACGTGAACCATCACCCTA 

ATCAAGTTTTTTGGGGTCGAGGTGCCGTAAAGCACTAAATCGGAACCCTAAAGGGAGCCCCC 

GATTTAGAGCTTGACGGGGAAAGCCGGCGAACGTGGCGAGAAAGGAAGGGAAGAAAGCGAAA 

GGAGCGGGCGCTAGGGCGCTGGCAAGTGTAGCGGTCACGCTGCGCGTAACCACCACACCCGC 

CGCGCTTAATGCGCCGCTACAGGGCGCGTC 



Fig. 9 
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pCMV-iuc-attB (SEQ ID NO: 2) 

CTCTATCGATAGGTACCGAGCTCTTACGCGTGCTAGCCCTCGAGCAGGATCTATACATTGAA 

TCAATATTGGCAATTAGCCATATTAGTCATTGGTTATATAGCATAAATCAATATTGGCTATT 

GGCCATTGCATACGTTGTATCTATATCATAATATGTACATTTATATTGGCTCATGTCCAATA 

TGACCGCCATGTTGACATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCATT 

AGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCT 

GACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCA 

ATAGGGACTTTCCATTGACGTCAATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGT 

ACATCAAGT6TATCATATGCCAAGTCCGCCCCCTATTGACGTCAATGACGGTAAATGGCCCG 

CCTGGCATTATGCCCAGTACATGACCTTACGGGACTTTCCTACTTGGCAGTACATCTACGTA 

TTAGTCATCGCTATTACCATGGTGATGCGGTTTTGGCAGTACATCAATGGGCGTGGATAGCG 

GTTTGACTCACGGGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGTTTGTTTTGGC 

ACCAAAATCAACGGGACTTTCCAAAATGTCGTAACAACTCCGCCCCATTGACGCAAATGGGC 

GGTAGGCGTGTACGGTGGGAGGTCTATATAAGCAGAGCTCGTTTAGTGAACCGTCAGATCGC 

CTGGAGACGCCATCCACGCTGTTTTGACCTCCATAGAAGACAGCGGGACCGATCCAGCCTCC 

CCTCGAAGCTCGACTCTAGGGGCTCGAGATCTGCGATCTAAGTAAGCTTGGCATTCCGGTAC 

TGTTGGTAAAGCCACCATGGAAGACGCCAAAAACATAAAGAAAGGCCCGGCGCCATTCTATC 

CGCTGGAAGATGGAACCGCTGGAGAGCAACTGCATAAGGCTATGAAGAGATACGCCCTGGTT 

CCTGGAACAATTGCTTTTACAGATGCACATATCGAGGTGGACATCACTTACGCTGAGTACTT 

CGAAATGTCCGTTCGGTTGGCAGAAGCTATGAAACGATATGGGCTGAATACAAATCACAGAA 

TCGTCGTATGCAGTGAAAACTCTCTTCAATTCTTTATGCCGGTGTTGGGCGCGTTATTTATC 

GGAGTTGCAGTTGCGCCCGCGAACGACATTTATAATGAACGTGAATTGCTCAACAGTATGGG 

CATTTCGCAGCCTACCGTGGTGTTCGTTTCCAAAAAGGGGTTGCAAAAAATTTTGAACGTGC 

AAAAAAAGCTCCC^^TCATCCAAAAAATTATTATCATGGATTCTAAAACGGATTACCAGGGA 

TTTCAGTCGATGTACACGTTCGTCACATCTCATCTACCTCCCGGTTTTAATGAATACGATTT 

TGTGCCAGAGTCCTTCGATAGGGACAAGACAATTGCACTGATCATGAACTCCTCTGGATCTA 

CTGGTCTGCCTAAAGGTGTCGCTCTGCCTCATAGAACTGCCTGCGTGAGATTCTCGCATGCC 

AGAGATCCTATTTTTGGCAATCAAATCATTCCGGATACTGCGATTTTAAGTGTTGTTCCATT 

CCATCACGGTTTTGGAATGTTTACTACACTCGGATATTTGATATGTGGATTTCGAGTCGTCT 

TAATGTATAGATTTGAAGAAGAGCTGTTTCTGAGGAGCCTTCAGGATTACAAGATTCAAAGT 

GCGCTGCTGGTGCCAACCCTATTCTCCTTCTTCGCCAAAAGCACTCTGATTGACAAATACGA 

TTTATCTAATTTACACGAAATT6CTTCTGGTGGCGGTCCCCTCTCTAAGGAAGTCGGGGAAG 

CGGTTGCCAAGAGGTTCCATCTGCCAGGTATCAGGCAAGGATATGGGCTCACTGAGACTACA 

TCAGCTATTCTGATTACACCCGAGGGGGATGATAJiACCGGGCGCGGTCGGTAAAGTTGTTCC 

ATTTTTTGAA6CGAAGGTTGTGGATCTGGATACCGGGAAAACGCTGGGCGTTAATCAAAGAG 

GCGAACTGTGTGTGAGAGGTCCTATGATTATGTCCGGTTATGTAAACAATCCGGAAGCGACC 

AACGCCTTGATTGACAAGGATGGATGGCTACATTCTGGAGACATAGCTTACTGGGACGAAGA 

CGAACACTTCTTCATCGTTGACCGCCTGAAGTCTCTGATTAAGTACAAAGGCTATCAGGTGG 

CTCCCGCTGAATTGGAATCCATCTTGCTCCAACACCCCAACATCTTCGACGCAGGTGTCGCA 

GGTCTTCCCGACGATGACGCCGGTGAACTTCCCGCCGCCGTTGTTGTTTTGGAGCACGGAAA 

GACGATGACGGAAAAAGAGATCGTGGATTACGTCGCCAGTCAAGTAACAACCGCGAAAAAGT 

TGCGCGGAGGAGTTGTGTTTGTGGACGAAGTACCGAAAG6TCTTACCGGAAAACTCGACGCA 

AGAAAAATCAGAGAGATCCTCATAAAQGCCAAGAAGGGCGGAAAGATCGCCGTGTAATTCTA 

GAGTCGGGGCGGCCGGCCGCTTCGAGCAGACATGATAAGATACATTGATGAGTTTGGACAAA 

CCACAACTAGAATGCAGTGAAAAAAATGCTTTATTTGTGAAATTTGTGATGCTATTGCTTTA 

TTTGTAACCATTATAAQCTGCAATAAACAAGTTAACAACAACAATTGCATTCATTTTATGTT 

TCAGGTTCAGGGGGAGGTGTGGGAGGTTTTTTAAAGCAAGTAAAACCTCTACAAATGTGGTA 

AAATCGATAAGGATCAATTCGGCTTCAGGTACCGTCGACGATGTAGGTCACGGTCTCGAAGC 

CGCGGTGCGGGTGCCAGGGCGTGCCCTTGGGCTCCCCGGGCGCGTACTCCACCTCACCCATC 

TGGTCCATCATGATGAACGGGTCGAGGTGGCGGTAGTTGATGCCGGCGAACGCGCGGCGCAC 

CGGGAAGCCCTCGCCCTCGAAACCGCTGGGCGCGGTGGTCACGGTGAGCACGGGACGTGCGA 

CGGCGTCGGCGGGTGCGGATACGCGGGGCAGCGTCAGCGGGTTCTCGACGGTCACGGCGGGC 

ATGTCGACAGCCGAATTGATCCGTCGACCGATGCCCTTGAGAGCCTTCAACCCAGTCAGCTC 

CTTCCGGTGGGCGCGGGGCATGACTATCGTCGCCGCACTTATGACTGTCTTCTTTATCATGC 
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AACTCGTAGGACAGGTGCCGGCAGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCGCTC 

GGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTC^AAGGCGGTAATACGGTTATCCACAG 

AATCAGGGGATAACGGAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGT 

AAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAA 

TCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCC 

CTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCC 

TTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCAATGCTCACGCTGTAGGTATCTCAGTTCGGT 

GTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCG 

CCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTGGCA 

GCAGCCACTGGTAAC7V.GGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAA 

GTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGC 

CAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAGC 

GGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCC 

TTTGATCTTTTCTAC6GGGTCTGACGCTCAGTGGAACGAAAACTCAGGTTAAGGGATTTTGG 

TCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAA 

TCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGC 

ACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGA 

TAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGACCCA 

CGCTCACCGGCTCCAGATTTATCAQGIAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAG 

TOGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAA 

GTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTACAGGCATCQTGGTGTCA 

CGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACATG 

ATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTA 

AGTTGGCCGCAGTGTTATavCTOVTGGTTATGGOilGCACTGCATAATTCTCTTACTGTCATG 

CCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAA6TCATTCTGAGAATAGTG 

TATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAATACGGGATAATACCGCGCCACATAGCA 

GAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTA 

CCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTT 

TACTTTCACCAGCGTTTCTGGGTGAGCaUUlAACAGGAAGGCAAAATGCCGCAAAAAAGGGAA 

TAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATT 

TATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAAT 

AGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGCGCCCTGTAGCGGCGCATTAA 

GCGCGGCGGGTGTGGTGGTTACGCGCAGCGT6ACCGCTACACTTGCCAGCGCCCTAGCGCCC 

GCTCCTTTCGCTTTCTTCCCTTCCTTTCTCGCCACGTTCGCCGGCTTTCCCCGTCAAGCTCT 

AAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCACCTCGACCCCAAAAAAC 

TTGATTAGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGATAGACGGTTTTTCGCCCTTTG 

ACGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTGTTCCAAACTGGAACAACACTCAACCC 

TATCTCGGTCTATTCTTTTGATTTATAAGGGATTTTGCCGATTTCGGCCTATTGGTTAAAAA 

AT6AGCT6ATTTAACAAAAATTTAACGCGAATTTTAACAAAATATTAACQTTTACAATTTCC 

CATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGATCGGTGCGGGCCTCTTCGCTATT 

ACGCCAGCCCAAGCTACCATGATAAGTAAGTAATATTAAGGTACGGGAGGTACTTGGAGCGG 

CCGCAATAAAATATCTTTATTTTCATTACATCTGTGTGTTGGTTTTTTGTGTGAATCGATAG 

TACTAACa^TACGCTCTCCATCAAAACAAAACGAAACys^AAACAAACTAGCAAAATAGGCTGTC 

CCCAGTGCAAGTGCAGGTGCCAGAACATTT 
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pCMV-luc-attP (SEQ ID NO: 3) 

CTCTATCGATAGGTACCGAGCTCTTACGCGTGCTAGCCCTCGAGCAGGATCTATACATTGAA 

TCAATATTGGCAATTAGCCATATTAGTCATTGGTTATATAGCATAAATCAATATTGGCTATT 

GGCCATTGCATACGTTGTATCTATATCATAATATGTACATTTATATTGGCTCATGTCCAATA 

TGACCGCCATGTTGACATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCATT 

AGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCT 

GACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCA 

ATAGGGACTTTCCATTGACGTCAATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGT 

ACATCAAGTGTATCATATGCCAAGTCCGCCCCCTATTGACGTCAATGACGGTAAATGGCGCG 

CCTGGCATTATGCCCAGTACATGACCTTACGGGACTTTCCTACTTGGCAGTACATCTACGTA 

TTAGTCATCGCTATTACCATGGTGATGCGGTTTTGGCAGTACATCAATGGGCGTGGATAGCG 

GTTTGACTCACGGGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGTTTGTTTTGGC 

ACCAAAATCAACGGGACTTTCCAAAATGTCGTAAO^CTCCGCCCCATTGACGCAAATGGGC 

GGTAGGCGTGTACGGTGGGAGGTCTATATAAGCAGAGCTCGTTTAGTGAACCGTCAGATCGC 

CTGGAGACGCCATCCACGCTGTTTTGACCTCCATAGAAGACACCGGGACCGATCCAGCCTCC 

CCTCGAAGCTCGACTCTAGGGGCTCGAGATCTGCGATCTAAGTAAGCTTGGCATTCCGGTAC 

TGTTGGTAAAGCCACCATGGAAGACGCCAAAAACATAAAGAAAGGCCCGGCGCCATTCTATC 

CGCTGGAAGATGGAACCGCTGGAGAGCAACTGCATAAGGCTATGAAQAGATACGCCCTGGTT 

CCTGGAACAATTGCTTTTACAGATGCACATATCGAGGTGGACATCACTTACGCTGAGTACTT 

CGAAATGTCCGTTCGGTTGGCAGAAGCTATGAAACGATATGGGCTGAATACAAATCACAGAA 

TCGTCGTATGCAGTGAAAACTCTCTTCAATTCTTTATGCCGGTGTTGGGCGCGTTATTTATC 

GGAGTTGCAGTTGCGCCCGCGAACGACATTTATAATGAACGTGAATTGCTCAACAGTATGGG 

CATTTCGCAGCCTACCGTGGTGTTCGTTTCCAAAAAGG6GTTGCAAAAAATTTTGAACGTGC 

AAAAAAAGCTCCCAATCATCCAAAAAATTATTATCATGGATTCTAAAACGGATTACCAGGGA 

TTTCAGTCGATGTACACGTTCGTCACATCTCATCTACCTCCCGGTTTTAATGAATACGATTT 

TGTGCCAGAGTCCTTCGATAGGGACAAGACAATTGCACTGATCATGAACTCCTCTGGATCTA 

CTGGTCTGCCTAAAGGTGTCGCTCTGCCTCATAGAACTGCCTGCGTGAGATTCTCGCATGCC 

AGAGATCCTATTTTTGGCAATCAAATGATTCCGGATACTGCGATTTTAAGTGTTGTTCCATT 

CCATCACGGTTTTGGAATGTTTACTACACTCGGATATTTGATATGTGGATTTCGAGTCGTCT 

TAATGTATAGATTTGAAGAAGAGCTGTTTCTGAGGAGCCTTCAGGATTACAAGATTCAAAGT 

GCGCTGCTGGTGCCAACCCTATTCTCCTTCTTCGCCAAAAGCACTCTGATTGACAAATACGA 

TTTATCTAATTTACACGAAATTGCTTCTGGTGGCGCTCCCCTCTCTAAGGAAGTCGGGGAAG 

CGGTTGCCAAGAGGTTCCATCTGCCAGGTATCAGGCAAGGATATGGGCTCACTGAGACTACA 

TCAGCTATTCTGATTACACCCGAGGGGGATGATAAACCGGGCGCGGTCGGTAAAGTTGTTCC 

ATTTTTTGAAGCGAAGGTTGTGGATCTGGATACCGGGAATy^CGCTGGGCGTTAATCAAAGAG 

GCGAACTGTGTGTGAGAGGTCCTATGATTATGTCCGGTTATGTAAACAATCCGGAAGCGACC 

AACGCCTTGATTGACAAGGATGGATGGCTACATTCTGGAGACATAGCTTACTGGGACGAAGA 

CGAACACTTCTTCATCGTTGACCGCCTGAAGTCTCTGATTAAGTACAAAGGCTATCAGGTGG 

CTCCCGCTGAATTGGAATCCATCTTGCTCCAACACCCCAACATCTTCGACGCAGGTGTCGCA 

GGTCTTCCCGACGATGACGCCGGTGAACTTCCCGCCGCCGTTGTTGTTTTGGAGCACGGAAA 

GACGATGACGGAAAAAGAGATCGTGGATTACGTCGCCAGTCAAGTAACAACCGCGAAAAAGT 

TGCGCGGAGGAGTTGTGTTTGTGGACGAAGTACCGAAAGGTCTTACCGGAAAACTCGACGCA 

AGAAAAATCAGAGAGATCCTCATAAAGGCCAAGAAGGGCGGAAAGATCGCCGTGTAATTCTA 

GAGTCGGGGCGGCCGGCCGCTTCGAGCAGACATGATAAGATACATTGATGAGTTTGGACAAA 

CCACAACTAGAATGCAGTGAAAAA7\ATGCTTTATTTGTGAAATTTGTGATGCTATTGCTTTA 

TTTGTAACCATTATAAGCTGCAATAAACAAGTTAACAACAACAATTGCATTCATTTTATGTT 

TCAGGTTCAGGGGGAGGTGTGGGAGGTTTTTTAAAGCAAGTAAAACCTCTACAAATGTGGTA 

AAATCGATAAGGATCAATTCGGCTTCGACTAGTACTGACGGACACACCGAAGCCCCGGCGGC 

AACCCTCAGCGGATGCCCCGGGGCTTCACGTTTTCCCAGGTCAGAAGCGGTTTTCGGGAGTA 

GTGCCCCAACTGGGGTAACCTTTGAGTTCTCTCAGTTGGGGGCGTAGGGTCGCCGACATGAC 

ACAAGGGGTTGTGACCGGGGTGGACACGTACGCGGGTGCTTACGACCGTCAGTCGCGCGAGC 

GCGACTAGTACAAGCCGAATTGATCCGTCGACCGATGCCCTTGAGAGCCTTCAACCCAGTCA 

GCTCCTTCCGGTGGGCGCGGGGCATGACTATCGTCGCCGCACTTATGACTGTCTTCTTTATC 

ATGCAACTCGTAGGACAGGTGCCGGCAGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGC 
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GCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGGCGGTAATACGGTTATCC 

ACAGAATC7VGGGGATAACGCAGGAAAGAACATGTGAGCy\AAAGGCCAGCAAAAGGCCAGGAA 

CCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACA 

AAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTT 

CCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTC 

CGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCAATGCTCACGCTGTAGGTATCTCAGTT 

CGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGC 

TGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACT 

GGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCT 

TGAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTG 

AAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGG 

TAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAG 

ATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATT 

TTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTT 

TAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTG 

AGGCACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGAGTCCCCGTCGTG 

TAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGA 

CCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCA 

GAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGA 

GTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTACAGGCATCGTGGT 

GTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTA 

CATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGA 

AGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGT 

CATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAAT 

AGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAATACGGGATAATACCGCGCCACAT 

AGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGAT 

CTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCAT 

CTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAG 

GGAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAG 

CATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAAC 

AAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGCGCCCTGTAGCGGCGCA 

TTAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGACCGCTACACTTGCCAGCGCCCTAGC 

GCCCGCTCCTTTCGCTTTCTTCCCTTCCTTTCTCGCCAC6TTCGCCGGCTTTCGCCGTCAAG 

CTCTAAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCACCTCGACCCCAAA 

AAACTTGATTAGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGATAGACGGTTTTTCGCCC 

TTTGACGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTGTTCCAAACTGGAACAACACTCA 

ACCCTATCTCGGTCTATTCTTTTGATTTATAAGGGATTTTGCCGATTTCGGCCTATTGGTTA 

AAAAATGAGCTGATTTAACAAAAATTTAACGCGAATTTTAACAAAATATTAACGTTTACAAT 

TTCCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGATCGGTGCGGGCCTCTTCGC 

TATTACGCCAGCCGAAGCTACCATGATAAGTAAGTAATATTAAGGTACGGGAGGTACTTGGA 

GCGGCCGCAATAAAATATCTTTATTTTCATTACATCTGTGTGTTGGTTTTTTGTGTGAATCG 

ATAGTACTAACATACGCTCTCCATCAAAACAAAACGAAACyU\AACAAACTAGCAAAATAGGC 

TGTCCCCAGTGCAAGTGCA.GGTGCCAGAACATTT 
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pCMV-pur-attB (SEQ ID NO: 4) 

CTAGAGTCGGGGCGGCCGGCCGCTTCGAGCAGACATGATAAGATACATTGATGAGTTTGGAC 

AAACCACAACTAGAATGCAGTGAAAAAAATGCTTTATTTGTGAAATTTGTGATGCTATTGCT 

TTATTTGTAACCATTATAAGCTGCAATAAACAAGTTAACAACAACAATTGCATTCATTTTAT 

GTTTCAGGTTCAGGGGGAGGTGTGGGAGGTTTTTTAAAGCAAGTAAAACCTCTACAAATGTG 

GTAAAATCGATAAGGATCAATTCGGCTTCAGGTACCGTCGACGATGTAGGTCACGGTCTCGA 

AGCCGCGGTGCGGGTGCCAGGGCGTGCCCTTGGGCTCCCCGGGCGCGTACTCCACCTCACCC 

ATCTGGTCCATCATGATGAACGGGTCGAGGTGGCGGTAGTTGATCCCGGCGAACGCGCGGCG 

CACCGGGAAGCCCTCGCCCTCGAAACCGCTGGGCGCGGTGGTCACGGTGAGCACGGGACGTG 

CGACGGCGTCGGCGGGTGCGGATACGCGGGGCAGCGTCAGCGGGTTCTCGACGGTCACGGCG 

6GCATGTCGACAGCCGAATTGATCCGTCGACCGATGCCCTTGAGAGCCTTCAACCCAGTCAG 

CTCCTTCCGGTGGGCGCGGGGCATGACTATCGTCGCCGCACTTATGACTGTCTTCTTTATCA 

TGCAACTCGTAGGACAGGTGCCGGCAGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCG 

CTCGGTCGTTCGGCTGCGGCGAGCGGTATCA6CTCACTCAAAGGCGGTAATACGGTTATCCA 

CS^GAATCAGGGGATAACGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAAC 

CGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAA 

AAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTC 

CCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCC 

GCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCAATGCTCACGCTGTAGGTATCTCAGTTC 

GGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCT 

GCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTQ 

GCavSCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTT 

GAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGA 

AGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGT 

AGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCa^GAAAAAAAGGATCTCAAGAAGA 

TCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTT 

TGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTT 

AAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGA 

GGCACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGT 

AGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGAC 

CCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAG 

AAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAG 

TAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTACAGGCATCGTGGTG 

TCACGCTCGTCGTTTGGTATGQCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTAC 

ATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAA 

GTAAGTTGGCCGCAGtGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTC 

ATGCCATCCGTAAGATGCTTTTCTGTGACTGGT6AGTACTCAACCAAGTCATTCTGAGAATA 

GTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAATACGGGATAATACCGCGCCACATA 

GCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATC 

TTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATC 

TTTTACTTTCACCAGCGTTTCTGGGTGAGCyW^CAGGAAGGCAAAATGCCGCAAAAAAGG 

GAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGC 

ATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACA 

AATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCT^CCTGACGCGCCCTGTAGCGGCGCAT 

TAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGACCGCTACACTTGCCAGCGCCCTAGCG 

CCCGCTCCTTTCGCTTTCTTCCCTTCCTTTCTCGCCACGTTCGCCGGCTTTCCCCGTCAAGC 

TCTAAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGT6CTTTACGGCACCTCGACCCCAAAA 

AACTTGATTAGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGATAGACGGTTTTTCGCCCT 

TTGACGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTGTTCCAAACTGGAACAACACTCAA 

CCCTATCTCGGTCTATTCTTTTGATTTATAAGGGATTTTGCCGATTTCGGCCTATTGGTTAA 

AAAATGAGCTGATTTAACAAAAATTTAACGCGAATTTTAACAAAATATTAACGTTTACAATT 

TCCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGATCGGTGCGGGCCTCTTCGCT 

ATTACGCCAGCCCAAGCTACCATGATAAGTAAGTAATATTAAGGTACGGGAGGTACTTGGAG 

CGGCCGCAATAAAATATCTTTATTTTCATTACATCTGTGTGTTGGTTTTTTGTGTGAATCGA 
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TAGTACTi^CATACGCTCTCCATCAAAACAAAACGAAACAAAACAAACTAGCAAAATAGGCT 

GTCCCCAGTGCT^GTGCAGGTGCCAGAACTVTTTCTCTATCGATAGGTACCGAGCTCTTACGC 

GTGCTAGCCCTCGAGCAGGATCTATACATTGAATCAATATTGGCAATTAGCCATATTAGTCA 

TTGGTTATATAGCATAAATCAATATTGGCTATTGGCCATTGCATACGTTGTATCTATATCAT 

AATATGTACATTTATATTGGCTCATGTCCAATATGACCGCCATGTTGACATTGATTATTGAC 

TAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCATATATGGAGTTCCGCG 

TTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGACG 

TCAATAATGACGTATGTTGCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGT 

GGAGTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTCCGC 

CCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTA 

CGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGTGATGCG 

GTTTTGGCAGTACATCAATGGGCiSTGGATAGCGGTTTGACTCACGGGGATTTCCAAGTCTCC 

ACCCCATTGACGTCAATGGGAOTTTGTTTTGGCACCAAAATCAACGGGACTTTCCAAAATGT 

CGTAACAACTCCGCCCCATTGACGCAAATGGGCGGTAGGCGTGTACGGTGGGAGGTCTATAT 

AAGCAGAGCTCGTTTAGTGAACCGTCAGATCGCCTGGAGACGCCATCCACGCTGTTTTGACC 

TCCATAGAAGACACCGGGACCGATCCAGCCTCCCCTCGAAGCTCGACTCTAGGGGCTCGAGA 

TCTGCGATCTAAGTAAGCTTGCATGCCTGCAGGTCGGCCGCCACGACCGGTGCCGCCACCAT 

CCCCTGACCCACGCCCCTGACCCCTCACAAGGAGACGACCTTCCATGACCGAGTACAAGCCC 

ACGGTGCGCCTCGCCACCCGCGACGACGTCCCCCGGGCCGTACGCACCCTCGCCGCCGCGTT 

CGCCGACTACCCCGCCACGCGCCACACCGTCGACCCGGACCGCCACATCGAGCGGGTCACCG 

AGCTGCAAGAACTCTTCCTCACGCGCGTCGGGCTCGACATCGGCAAGGTGTGGGTCGCGGAC 

GACGGCGCCGCGGTGGCGGTCTGGACCACGCCGGAGAGCGTCGAAGCGGGGGCGGTGTTCGC 

CGAGATCGGCCCGCGCATGGCCGAGTTGAGCGGTTCCCGGCTGGCCGCGCAGCAACAGATGG 

AAGGCCTCCTGGCGCCGCACCGGCCCAAGGAGCCCGCGTGGTTCCTGGCCACCGTCGGCGTC 

TCGCCCGACCACCAGGGCAAGGGTCTGGGCAGCGCCGTCGTGCTCCCCGGAGTGGAGGCGGC 

CGAGCGCGCCGGGGTGCCCGCCTTCCTGGAGACCTCCGCGCCCCGCAACCTCCCCTTCTACG 

AGCGGCTCGGCTTCACCGTCACCGCCGACGTCGAGGTGCCCGAAGGACCGCGCACCTGGTGC 

ATGACCCGCAAGCCCGGTGCCTGACGCCCGCCCCACGACCCGCAGCGCCCGACCGAAAGGAG 

CGCACGACCCCATGGCTCCGACCGAAGCCGACCCGGGCGGCCCCGCCGACCCCGCACCCGCC 

CCCGAGGCCCACCGACT 
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pCMV-pur-attP (SEQ ID NO: 5) 

CTAGAGTCGGGGCGGCCGGCCGCTTCGAGCAGACATGATAAGATACATTGATGAGTTTGGAC 

AAACCACAACTAGAATGCAGTGAAAAAAATGCTTTATTTGTGAAATTTGTGATGCTATTGCT 

TTATTTGTAACCATTATAAGCTGCAATAAACAAGTTAACAACAACAATTGCATTCATTTTA 

GTTTCAGGTTO^GGGGGAGGTGTGGGAGGTTTTTTAAAGCyiAGTAAAACCTCTACAAATGTG 

GTAAAATCGATAAGGATCAATTCGGCTTCGACTAGTACTGACGGACACACCGAAGCCCCGGC 

GGCAACCCTCAGCGGATGCCCCGGGGCTTCACGTTTTCCCAGGTCAGAAGCGGTTTTCGGGA 

GTAGTGCCCCAACTGGGGTAACCTTTGAGTTCTCTCAGTTGGGGGCGTAGGGTCGCCGACAT 

GACACAAGGGGTTGTGACCGGGGTGGACACGTACGCGGGTGCTTACGACCGTCAGTCGCGCG 

AGCGCGACTAGTACAAGCCGAATTGATCCGTCGACCGATGCCCTTGAGAGCCTTCAACCCAG 

TCAGCTCCTTCCGGTGGGCGCGGGGCATGACTATCGTCGCCGCACTTATGACTGTCTTCTTT 

ATCATGCAACTCGTAGGACAGGTGCCGGCAGCGCTCTTCCGCTTCCTCGCTCACTGACTCGC 

TGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCI^AAGGCGGTAATAGGGTTA 

TCCACAGAATCAGGGGATAACGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAG 

GAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATC 

ACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAOATACCAGGCG 

TTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCT 

GTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCAATGCTCACGCTGTAGGTATCTCA 

GTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGAC 

CGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCC 

ACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAQAGT 

TCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAA.GGACAGTATTTGGTATCTGCGCTCTG 

CTGAAGCCAGTTACCTTCGGAAAAAGA6TTGGTAGCTCTTGATCC6GCAAACAAACCACCGC 

TGGTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAG 

AAGATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGG 

ATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAG 

TTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCA 

GTGAGGCACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTC 

GTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCG 

AGACCCT^CGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGC 

GCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCT 

AGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTACAGGCATCGT 

GGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAG 

TTACATGATCCCCCATQTTGTGC2^AAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTC 

AGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTAC 

TGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAG 

AATAGTGTATGCGGCQACCGAGTTGCTCTTGCCCGGCGTCAATACGGGATAATACCGCGCCA 

CATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAG 

GATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAG 

CATCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAA 

AAGGGAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTG 

AAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATA 

AACAAATAGGGGTTCCGCGCACATTTCCCCGAAimGTGCCACCTGACGCGCCCTGTAGCGGC 

GCATTAAGCX3CGGCGGGTGTGGTGGTTACGCGCAGCGTGACCGCTACACTTGCCAGCGCCCT 

AGCGCCCGCTCCTTTCGCTTTCTTCCCTTCCTTTCTCGCCACGTTCGCCGGCTTTCCCCGTC 

AAGCTCTAAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTACGQCACCTCGACCCC 

AAAAAACTTGATTAGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGATAGACGGTTTTTCG 

CCCTTTGACGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTGTTCCAAACTGGAACAACAG 

TCAACCCTATCTCGGTCTATTCTTTTGATTTATAAGGGATTTTGCCGATTTCGGCCTATTGG 

TTAAAAAATGAGCTGATTTAACAAAAATTTAACGCGAATTTTAACAAPIlATATTAACGTTTAC 

aatttcccattcgccattcaggctgcgcaactgttgggaagggcgatcggtgcgggcctctt 
cgctattacgccagcccaagctaccatgataagtaagtaatattaaggtacgggaggtactt 
ggagcggccgcaataaaatatctttattttcattacatctgtgtgttggttttttgtgtgaa 
tcgatagtactaacatacgctctccatcaaaacaaaacgaaacaaaacaaactagcaaaata 
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GGCTGTCCCCAGTGCAAGTGCAGGTGCCAGAACATTTCTCTATCGATAGGTACCGAGCTCTT 

ACGCGTGCTAGCCCTCGAGCAGGATCTATACATTGAATCAATATTGGCAATTAGCCATATTA 

GTCATTGGTTATATAGCATAAATCAATATTGGCTATTGGCCATTGCATACGTTGTATCTATA 

TCATAATATGTACATTTATATTGGCTCATGTCCAATATGACCGCCATGTTGACATTGATTAT 

TGACTAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCATATATGGAGTTC 

CGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATT 

GACGTCJ^TAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAAT 

GGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGT 

CCGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGAC 

CTTACGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGTGA 

TGCGGTTTTGGCAGTACATCAATGGGCGTGGATAGCGGTTTGACTCACGGGGATTTCCAAGT 

CTGCACCCCATTGACGTGAATGGGAGTTTGTTTTGGCACCAAAATO^ACGGGACTTTCCAAA 

ATGTCGTAACAACTCCGCCCCATTGACGCAAATGGGCGGTAGGCGTGTACGGTGGGAGGTCT 

ATATAAGCAGAGCTCGTTTAGTGAACCGTCAGATCGCCTGGAGACGCCATCCACGCTGTTTT 

GACCTCCATAGAAGACACCGGGACCGATCCAGCCTCCCCTCGAAGCTCGACTCTAGGGGCTC 

GAGATCTGCGATCTAAGTAAGCTTGCATGCCTGCAGGTCGGCCGCCACGACCGGTGCCGCCA 

CCATCCCCTGACCCACGCCCCTGACCCCTCACAAGGAGACGACCTTCCATGACCGAGTACAA 

GCCCACGGTGCGCCTCGCCACCCGCGACGACGTCCCCCGGGCCGTACGCACCCTCGCCGCCG 

CGTTCGCCGACTACCCCGCCACGCGCCACACCGTCGACCCGGACCGCCACATCGAGCGGGTC 

ACCGAGCTGCAAGAACTCTTCCTCACGCGCGTCGGGCTCGACATCGGCAAGGTGTGGGTCGC 

GGACGACGGCGCCGCGGTGGCGGTCTGGACCACGCCGGAGAGCGTCGAAGCGGGGGCGGTGT 

TCGCCGAGATCGGCCCGCGCATGGCCGAGTTGAGCGGTTCCCGGCTGGCCGCGCAGCAACAG 

ATGGAAGGCCTCCTGGCGCCGCACCGGCCCAAGGAGCCCGCGTGGTTCCTGGCCACCGTCGG 

CGTCTCGCCCGACCACCAGGGCAAGGGTCTGGGCAGCGCCGTCGTGCTCCCCGGAGTGGAGG 

CGGCCGAGCGCGCCGGGGTGCCCGCCTTCCTGGAGACCTCCGCGCCCCGCAACCTCCCCTTC 

TACGAGCGGCTCGGCTTCACCGTCACCGCCGACGTCGAGGTGCCCGAAGGACCGCGCACCTG 

GTGCATGACCCGCAAGCCCGGTGCCTGACGCCCGCCCCACGACCCGCAGCGCCCGACCGAAA 

GGAGCGCACGACCCCATGGCTCCGACCGAAGCCGACCCGGGCGGCCCCGCCGACCCCGCACC 

CGCCCCCGAGGCCCACCGACT 
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pCMV-EGFP-attB (SEQ ID NO: 6) 

CTAGAGTCGGGGCGGCCGGCCGCTTCGAGCAGACATGATAAGATACATTGATGAGTTTGGAC 

AAACCACAACTAGAATGCAGTGAAAAAAATGCTTTATTTGTGAAATTTGTGATGCTATTGCT 

TTATTTGTAACCATTATAAGCTGCAATAAACAAGTTAACAACAACAATTGGATTCATTTTAT 

GTTTCAGGTTCAGGGGGAGGTGTGGGAGGTTTTTTAAAGCAAGTAAAACCTCTACAAATGTG 

GTAAAATCGATAAGGATCAATTCGGCTTCAGGTACCGTCGACGATGTAGGTCACGGTCTCGA 

AGCCGCGGTGCGGGTGCCAGGGCGTGCCCTTGGGCTCCCCGGGCGCGTACTCCACCTCACCC 

ATCTGGTCCATCATGATGAACGGGTCGAGGTGGCGGTAGTTGATCCCGGCGAACGCGCGGCG 

CACCGGGAAGCCCTCGCCCTCGAAACCGCTGGGCGCGGTGGTCACGGTGAGCACGGGACGTG 

CGACGGCGTCGGCGGGTGCGGATACGCGGGGCAGCGTCAGCGGGTTCTCGACGGTCACGGGG 

GGCATGTCGACAGCCGAATTGATCCGTCGACCGATGCCCTTGAGAGCCTTCAACCCAGTCAG 

CTCCTTCCGGTGGGCGCGGGGCATGACTATCGTCGCCGCACTTATGACTGTCTTCTTTATCA 

TGCAACTCGTAGGACAGGTGCCGGCAGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCG 

CTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGGCGGT?iLATACGGTTATCCA 

CAGAATCAGGGGATAACGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAAC 

CGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTGCGCCGCCCTGACGAGCATCACAA 

AAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTC 

CCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCC 

GCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCAATGCTCACGCTGTAGGTATCTCA6TTC 

GGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCT 

GCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTG 

GCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTT 

GAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGA 

AGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGT 

AGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGA 

TCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTT 

TGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTT 

AAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGA 

GGCACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGT 

AGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGAC 

CCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAG 

AAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAG 

TAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTACAGGCATCGTGGTG 

TCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTAC 

ATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAA 

GTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTC 

ATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATA 

GTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAATACGGGATAATACCGCGCCACATA 

GCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATC 

TTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATC 

TTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGG 

GAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGC 

ATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACA 

AATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGCGCCCTGTAGCGGCGCAT 

TAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGACCGCTACACTTGCCAGCGCCCTAGCG 

CCCGCTCCTTTCGCTTTCTTCCCTTCCTTTCTCGCCACGTTCGCCGGCTTTCCCCGTCAAGC 

TCTAAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCACCTCGACCCCAAAA 

AACTTGATTAGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGATAGACGGTTTTTCGCCCT 

TTGACGTTGGAGTCCa.CGTTCTTTAATAGTGGACTCTTGTTCCAAACTGGAACAACACTCAA 

CCCTATCTCGGTCTATTCTTTTGATTTATAAGGGATTTTGCCGATTTCGGCCTATTGGTTAA 

AAAATGAGCTGATTTAACAAAAATTTAACGCGAATTTTAACAAAATATTAACGTTTACAATT 

TCCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGATCGGTGCGGGCGTCTTCGCT 

ATTACGCCAGCCCAAGCTACCATGATAAGTAAGTAATATTAAGGTACGGGAGGTACTTGGAG 

CGGCCGCAATAAAATATCTTTATTTTCATTACATCTGTGTGTTGGTTTTTTGTGTGAATCGA 



wo 2004/092351 



20/87 



PCT/US2004/009253 



TAGTACTAA.CATACGCTCTCCATCAAAACAAAACGAAACAAAACAAACTAGCAAAATAGGCT 
GTCCCCAGTGCAAGTGCAGGTGCCAGAACATTTCTCTATCGATAGGTACCGAGCTCTTACGC 
GTGCTAGCCCTCGAGCAGGATCTATACATTGAATCAATATTGGCAATTAGCCATATTAGTCA 
TTGGTTATATAGCATAAATCAATATTGGCTATTGGCCATTGCATACGTTGTATCTATATCAT 
AATATGTACATTTATATTGGCTCATGTCCAATATGACCGCCATGTTGACATTGATTATTGAC 
TAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCATATATGGAGTTCCGCG 
TTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGACG 
TCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGT 
GGAGTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTCCGC 
CCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTA 
CGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGTGATGCG 
GTTTTGGCAGTACATCAATGGGCGTGGATAGCGGTTTGACTCACGGGGATTTCCAAGTCTCC 
ACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCAAAATCAACGGGACTTTCCAAAATGT 
CGTAACAACTCCGCCCCATTGACGCAAATGGGCGGTAGGCGTGTACGGTGGGAGGTCTATAT 
AAGCAGAGCTCGTTTAGTGAACCGTCAGATCGCCTGGAGACGCCATCCACGCTGTTTTGACC 
TCCATAGAAGACACCGGGACCGATCCAGCCTCCCCTCGAAGCTCGACTCTAGGGGCTCGAGA 
TCCCCGGGTACCGGTCGCCACCATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGC 
CCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGGGC 
GAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCC 
CGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACC 
CCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAG 
CGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGG 
CGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCC 
TGGGGCACAAGCTGGAGTACAACTAC^CAGCCACAACGTCTATATC^TGGCCGACAAGCAG 
AAGAACGGCATCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCT 
CGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACC 
ACTACCTGAGCACCCAGTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTC 
CTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACAAGTAAAG 
CGGCCGCTCGAGCATGCAT 



Fig. 14 



7 



wo 2004/092351 



21/87 



PCT/US2004/009253 



p-12.0-lys-LSPIFNMM-CMV-pur-attB (SEQ ID NO: 7) 

GGGCTGCAGGAATTCGATTGCCGCCTTCTTTGATATTCACTCTGTTGTATTTCATCTCTTCT 
TGCCGATGAAAGGATATAACAGTCTGTATAACAGTCTGTGAGGAAATACTTGGTATTTCTTC 
TGATCAGTGTTTTTATAAGTAATGTTGAATATTGGATAAGGCTGTGTGTCCTTTGTCTTGGG 
AGACAAAGCCCACAGCAGGTGGTGGTTGGGGTGGTGGCAGCTCAGTGACAGGAGAGGTTTTT 
TTGCCTGTTTTTTTTTTTTTTTTTTTTTTTAAGTAAGGTGTTCTTTTTTCTTAGTAAATTTT 
CTACTGGACTGTATGTTTTGACAGGTCAGAAACATTTCTTCAAAAGAAGAACCTTTTGGAAA 
CTGTACAGCCCTTTTCTTTCATTCCCTTTTTGCTTTCTGTGCCAATGCCTTTGGTTCTGATT 
GCATTATGGAAAACGTTGATCGGAACTTGAGGTTTTTATTTATAGTGTGGCTTGAAAGCTTG 
GATAGCTGTTGTTACACGAGATACCTTATTAAGTTTAGGCCAGCTTGATGCTTTATTTTTTC 
CCTTTGAAGTAGTGAGCGTTCTCTGGTTTTTTTCCTTTGAAACTGGTGAGGCTTAGATTTTT 
CTAATGGGATTTTTTACCTGATGATCTAGTTGCATACCCAAATGCTTGTAAATGTTTTCCTA 
GTTAACATGTTGATAACTTCGGATTTACATGTTGTATATACTTGTCATCTGTGTTTCTAGTA 
AAAATATATGGCATTTATAGAAATACGTAATTCCTGATTTCCTTTTTTTTTATCTCTATGCT 
CTGTGTGTACAGGTCAAACAGACTTCACTCCTATTTTTATTTATAGAATTTTATATGCAGTC 
TGTCGTTGGTTCTTGTGTTGTAAGGATACAGCCTTAAATTTCCTAGAGCGATGCTCAGTAAG 
GCGGGTTGTCACATGGGTTCAAATGTAAAACGGGCACGTTTGGCTGCTGCCTTCCCGAGATC 
CAGGACACTAAACTGCTTCTGCACTGAGGTATAAATCGCTTCAGATCCCAGGGAAGTGCAGA 
TCCACGTGCATATTCTTAAAGAAGAATGAATACTTTCTAAAATATTTTGGCATAGGAAGCAA 
GCTGCATGGATTTGTTTGGGACTTAAATTATTTTGGTAACGGAGTGCATAGGTTTTAAACAC 
AGTTGCAGCATGCTAACGAGTCACAGCGTTTATGCAGAAGTGATGCCTGGATGCCTGTTGCA 
GCTGTTTACGGCACTGCCTTGCAGTGAGCATTGCAGATAGGGGTGGGGTGCTTTGTGTCGTG 
TTCCCACACGCTGCCACACAGCCACCTCCCGGAACACATCTCACCTGCTGGGTACTTTTCAA 
ACCATCTTAGCAGTAGTAGATGAGTTACTATGAAACAGAGAAGTTCCTCAGTTGGATATTCT 
CATGGGATGTCTTTTTTCCCATGTTGGGCAAAGTATGATAAAGCATCTCTATTTGTAAATTA 
TGCACTTGTTAGTTCCTGAATCCTTTCTATAGCACCACTTATTGCAGCAGGTGTAGGCTCTG 
GTGTGGCCTGTGTCTGTGCTTCAATCTTTTAAAGCTTCTTTGGAAATACACTGACTTGATTG 
AAGTCTCTTGAAGATAGTAAACAGTACTTACCTTTGATCCCAATGAAATCGAGCATTTCAGT 
TGTAAAAGAATTCCGCCTATTCATACCATGTAATGTAATTTTACACCCCCAGTGCTGACACT 
TTGGAATATATTCAAGTAATAGACTTTGGCCTCACCCTCTTGTGTACTGTATTTTGTAATAG 
AAAATATTTTAAACTGTGCATATGATTATTACATTATGAAAGAGACATTCTGCTGATCTTCA 
AATGTAAGAAAATGAGGAGTGCGTGTGCTTTTATAAATACAAGTGATTGCAAATTAGTGCAG 
GTGTCCTTAAAAAAAAAAAAAAAAAGTAATATAAAAAGGACCAGGTGTTTTACAAGTGAAAT 
ACATTCCTATTTGGTAAACAGTTACATTTTTATGAAGATTACCAGCGCTGCTGACTTTCTAA 
ACATAAGGCTGTATTGTCTTCCTGTACCATTGCATTTCCTCATTCCCAATTTGCACAAGGAT 
GTCTGGGTAAACTATTCAAGAAATGGCTTTGAAATACAGCATGGGAGCTTGTCTGAGTTGGA 
ATGCAGAGTTGCACTGCAAAATGTCAGGAAATGGATGTCTCTCAGAATGCCCAACTCCAAAG 
GATTTTATATGTGTATATAGTAAGCAGTTTCCTGATTCCAGCAGGCCAAAGAGTCTGCTGAA 
TGTTGTGTTGGCGGAGAGCTGTATTTCTCAACAAGGTAAGATGGTATCCTAGCAACTGCGGA 
TTTTAATACATTTTCAGCAGAAGTACTTAGTTAATCTCTACCTTTAGGGATCGTTTCATCAT 
TTTTAGATGTTATACTTGAAATACTGCATAACTTTTAGCTTTCATGGGTTCCTTTTTTTCAG 
CCTTTAGGAGACTGTTAAGCAATTTGCTGTCCAACTTTTGTGTTGGTCTTAAACTGCAATAG 
TAGTTTACCTTGTATTGAAGAAATAAAGACCATTTTTATATTAAAAAATACTTTTGTCTGTC 
TTCATTTTGACTTGTCTGATATCCTTGCAGTGCCCATTATGTCAGTTCTGTCAGATATTCAG 
ACATCAAAACTTAACGTGAGCTCAGTGGAGTTACAGCTGCGGTTTTGATGCTGTTATTATTT 
CTGAAACTAGAAATGATGTTGTCTTCATCTGCTCATCAAACACTTCATGCAGAGTGTAAGGC 
TAGTGAGAAATGCATACATTTATTGATACTTTTTTAAAGTCAACTTTTTATCAGATTTTTTT 
TTCATTTGGAAATATATTGTTTTCTAGACTGCATAGCTTCTGAATCTGAAATGCAGTCTGAT 
TGGCATGAAGAAGCACAGCACTCTTCATCTTACTTAAACTTCATTTTGGAATGAAGGAAGTT 
AAGCAAGGGCACAGGTCCATGAAATAGAGACAGTGCGCTCAGGAGAAAGTGAACCTGGATTT 
CTTTGGCTAGTGTTCTAAATCTGTAGTGAGGAAAGTAACACCCGATTCCTTGAAAGGGCTCC 
AGCTTTAATGCTTCCAA?VTTGAAGGTGGCAGGCAACTTGGCCACTGGTTATTTACTGCATTA 
TGTCTCAGTTTCGCAGCTAACCTGGCTTCTCCACTATTGAGCATGGACTATAGCCTGGCTTC 
AGAGGCCAGGTGAAGGTTGGGATGGGTGGAAGGAGTGCTGGGCTGTGGCTGGGGGGACTGTG 
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GGGACTCCAAGCTGAGCTTGGGGTGGGCAGCACAGGGAAAAGTGTGGGTAACTATTTTTAAG 
TACTGTGTTGCAAACGTCTCATCTGCAAATACGTAGGGTGTGTACTCTCGAAGATTAACAGT 
GTGGGTTCAGTAATATATGGATGAATTCACAGTGGAAGCATTCAAGGGTAGATCATCTAACG 
ACACCAGATCATCAAGCTATGATTGGAAGCGGTATCAGAAGAGCGAGGAAGGTAAGCAGTCT 
TCATATGTTTTCCCTCCACGTAAAGCAGTCTGGGAAAGTAGCACCCCTTGAGCAGAGACAAG 
GAAATAATTCAGGAGCATGTGCTAGGAGAACTTTCTTGCTGAATTCTACTTGCAAGAGCTTT 
GATGCCTGGCTTCTGGTGCCTTCTGCAGCACCTGCAAGGCCCAGAGCCTGTGGTGAGCTGGA 
GGGAAAGATTCTGCTCAAGTCCAAGCTTCAGCAGGTCATTGTCTTTGCTTCTTCCCCCAGCA 
CTGTGCAGCAGAGTGGAACTGATGTCGAAGCCTCCTGTCCACTACCTGTTGCTGCAGGCAGA 
CTGCTCTCAGAAAAAGAGAGCTAACTCTATGCCATAGTCTGAAGGTAAAATGGGTTTTAAAA 
AAGAAAACACAAAGGCAAAACCGGCTGCCCCATGAGAAGAAAGCAGTGGTAAACATGGTAGA 
AAAGGTGCAGAAGCCCCCAGGCAGTGTGACAGGCCCCTCCTGCCACCTAGAGGCGGGAACAA 
GCTTCCCTGCCTAGGGCTCTGCCCGCGAAGTGCGTGTTTCTTTGGTGGGTTTTGTTTGGCGT 
TTGGTTTTGAGATTTAGACACAAGGGAAGCCTGAAAGGAGGTGTTGGGCACTATTTTGGTTT 
GTAAAGCCTGTACTTCAAATATATATTTTGTGAGGGAGTGTAGCGAATTGGCCAATTTAAAA 
TAAAGTTGCAAGAGATTGAAGGCTGAGTAGTTGAGAGGGTAACACGTTTAATGAGATCTTCT 
GAAACTACTGCTTCTAAACACTTGTTTGAGTGGTGAGACCTTGGATAGGTGAGTGCTCTTGT 
TACATGTCTGATGCACTTGCTTGTCCTTTTCCATCCACATCCATGCATTCCACATCCACGCA 
TTTGTCACTTATCCCATATCTGTCATATCTGACATACCTGTCTCTTCGTCACTTGGTCAGAA 
GAAACAGATGTGATAATCCCCAGCCGCCCCAAGTTTGAGAAGATGGCAGTTGCTTCTTTCCC 
TTTTTCCTGCTAAGTAAGGATTTTCTCCTGGCTTTGACACCTCACGAAATAGTCTTCCTGCC 
TTACATTCTGGGCATTATTTCAAATATCTTTGGAGTGCGCTGCTCTCAAGTTTGTGTCTTCC 
TACTCTTAGAGTGAATGCTCTTAGAGTGAAAGAGAAGGAAGAGAAGATGTTGGCCGCAGTTC 
TCTGATGAACACACCTCTGAATAATGGCCAAAGGTGGGTGGGTTTCTCTGAGGAACGGGCAG 
CGTTTGCCTCTGAAAGCAAGGAGCTCTGCGGAGTTGCAGTTATTTTGCAACTGATGGTGGAA 
CTGGTGCTTAAAGCAGATTCCCTAGGTTCCCTGCTACTTCTTTTCCTTCTTGGCAGTCAGTT 
TATTTCTGACAGACAAACAGCCACCCCCACTGCAGGCTTAGAAAGTATGTGGCTCTGCCTGG 
GTGTGTTACAGCTCTGCCCTGGTGAAAGGGGATTAAAACGGGCACCATTCATCCCAAACAGG 
ATCCTCATTCATGGATCAAGCTGTAAGGAACTTGGGCTCCAACCTCAAAACATTAATTGGAG 
TACGAATGTAATTAAAACTGCATTCTCGGATTCCTAAGTCATTTAGTCTGGACTCTGCAGCA 
TGTAGGTCGGCAGCTCCCACTTTCTCAAAGACCACTGATGGAGGAGTAGTAAAAATGGAGAC 
CGATTCAGAACAACCAACGGAGTGTTGCCGAAGAAACT6ATGGAAATAATGCATGAATTGTG 
TGGTGGACATTTTTTTTAAATACATAAACTACTTCAAATGAGGTCGGAGAAGGTCAGTGTTT 
TATTAGCAGCCATAAAACCAGGTGAGCGAGTACCATTTTTCTCTACAAGAAAAACGATTCTG 
AGCTCTGCGTAAGTATAAGTTCTCCATAGCGGCTGAAGCTCCCCCCTGGCTGCCTGCCATCT 
CAGCTGGAGTGCAGTGCCATTTCCTTGGGGTTTCTCTCACAGCAGTAATGGGACAATACTTC 
ACAAAAATTCTTTCTTTTCCTGTCATGTGGGATCCCTACTGTGCCCTCCTGGTTTTACGTTA 
CCCCCTGACTGTTCGATTCIAGCGGTTTGGAAAGAGAAAAAGAATTTGGAAATAAAACATGTC 
TACGTTATCACCTCCTCCAGCATTTTGGTTTTTAATTATGTCAATAACTGGCTTAGATTTGG 
AAATGAGAGGGGGTTGGGTGTATTACCGAGGAACAAAGGAAGGCTTATATAAACTCAAGTCT 
TTTATTTAGAGAACTGGCAAGCTGTCAAAAACAAAAAGGCCTTACCACCAAATTAAGTGAAT 
AGCCGCTATAGCCAGCAGGGCCAGCACGAGGGATGGTGCACTGCTGGCACTATQCCACGGCC 
TGCTTGTGACTCTGAGAGCAACTGCTTTGGATy^TGACAGCACTTGGTGCAATTTCCTTTGTT 
TCAGAATGCGTAGAGCGTGTGCTTGGCGACAGTTTTTCTAGTTAGGCCACTTCTTTTTTCCT 
TCTCTCCTCATTCTCCTAAGCATGTCTCCATGCTGGTAATCCCAGTCAAGTGAACGTTCAAA 
CAATGAATCCATCACTGTAGGATTCTCGTGGTGATCAAATCTTTGTGTGAGGTCTATAAAAT 
ATGGAAGCTTATTTATTTTTCGTTCTTCCATATCAGTCTTCTCTATGACAATTCACATCCAC 
CACAGCAAATTAAAGGTGAAGGAGGCTGGTGGGATGAAGAGGGTCTTCTAGCTTTACGTTCT 
TCCTTGCAAGGCCACAGGAAAATGCTGAGAGCTGTAGAATACAGCCTGGGGTAAGAAGTTCA 
GTCTCCTGCTGGGACAGCTAACCGCATCTTATAACCCCTTCTGAGACTCATCTTAGGACCAA 
ATAGGGTCTATCTGGGGTTTTTGTTCCTGCTGTTCCTCCTGGAAGGCTATCTCACTATTTCA 
CTGCTCCCACGGTTACAAACCAAAGATACAGCCTGAATTTTTTCTAGGCCACATTACATAAA 
TTTGACCTGGTACCAATATTGTTCTCTATATAGTTATTTCCTTGCCCACTGTGTTTAAGCCC 
TTAAGGCATTCAGAACAACTAGAATCATAGAATGGTTTGGATTGGAAGGGGCCTTAAACATC 
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ATCCATTTCCAACCCTCTGCCATGGGCTGCTTGCGACCCACTGGCTCAGGCTGCCCAGGGCC 
CCATCCAGCCTGGCCTTGAGCACCTCCAGGGATGGGGCACCCACAGCTTCTCTGGGCAGCCT 
GTGCCAACACCTCACCACTCTCTGGGTAAAGAATTCTCTTTTAACATCTAATCTAAATCTCT 
TCTCTTTTAGTTTAAAGCCATTCCTCTTTTTCCCGTTGCTATCTGTCCAAGAAATGTGTATT 
GGTCTCCCTCCTGCTTATAAGCAGGAAGTACTGGAAGGCTGCAGTGAGGTCTCCCCAGAGCC 
TTCTCTTCTCCAGGCTGAACAAGCCCAGCTCCTTCAGCCTGTCTTCGTAGGAGATCATGTTA 
GTGGCCCTCCTCTGGACCCATTCCAACAGTTCCACGGCTTTCTTGTGGAGCCCCAGGTCTGG 
ATGCAGTACTTCAGATGGGGCCTTACAAAGGCAGAGCAGATGGGGACAATCGCTTACCCCTC 
CCTGCTGGCTGCCCCTGTTTTGATGCAGCCCAGGGTACTGTTGGCCTTTCAGGCTCCCAGAC 
CCCTTGCTGATTTGTGTCAAGCTTTTCATCCACCAGAACCCACGCTTCCTGGTTAATACTTC 
TGCCCTCACTTCTGTAAGCTTGTTTCAGGAGACTTCCATTCTTTAGGACAGACTGTGTTACA 
CCTACCTGCCCTATTCTTGCATATATACATTTCAGTTCATGTTTCCTGTAACAGGACAGAAT 
ATGTATTCCTCTAACAAAAATACATGCAGAATTCCTAGTGCCATCTCAGTAGGGTTTTCATG 
GCAGTATTAGCACATAGTCAATTTGCTGCAAGTAGCTTCCAAGCTGCGGCCTCCCATAAATC 
CTGTATTTGGGATCAGTTACCTTTTGGQGTAAGCTTTTGTATCTGCAGAGACCCTGGGGGTT 
CTGATGTGCTTCAGCTCTGCTCTGTTCTGACTGCACCATTTTCTAGATCACCCAGTTGTTCC 
TGTACAACTTCCTTGTCCTCCATCCTTTCCCAGCTTGTATCTTTGACAAATACAGGCCTATT 
TTTGTGTTTGCTTCAGCAGCCATTTAATTCTTCAGTGTCATCTTGTTCTGTTGATGCCACTG 
GAACAGGATTTTCAGCAGTCTTGCAAAGAACATCTAGCTGAAAACTTTCTGCCATTCAATAT 
TCTTACCAGTTCTTCTTGTTTGAGGTGAGCCATAAATTACTAGAACTTCGTCACTGACAAGT 
TTATGCATTTTATTACTTCTATTATGTACTTACTTTGACATAACACAGACACGGACATATTT 
TGCTGGGATTTCCACAGTGTCTCTGTGTCCTTCACATGGTTTTACTGTCATACTTCCGTTAT 
AACCTTGGCAATCTGCCCAGCTGCCCATCACAAGAAAAGAGATTCCTTTTTTATTACTTCTC 
TTCAGCCAATAAACAAAATGTGAGAAGCCCAAACAAGAACTTGTGGGGCAGGCTGCCATCAA 
GGGAGAGACAGCTGAAGGGTTGTGTAGCTCAATAGAATTAAGAAATAATAAAGCTGTGTCAG 
ACAGTTTTGCCTGATTTATACAGGCACGCCCCAAGCCAGAGAGGCTGTCTGCCAAGGCCACC 
TTGCAGTCCTTGGTTTGTAAGATAAGTCATAGGTAACTTTTCTGGTGAATTGCGTGGAGAAT 
CATGATGGCAGTTCTTGCTGTTTACTATGGTAAGATGCTAAAATAGGAGACAGCAAAGTAAC 
ACTTGCTGCTGTAGGTGCTCTGCTATCCAGACAGCGATGGCACTCGCACACCAAGATGAGGG 
ATGCTCCCAGCTGACGGATGCTGGGGCAGTAACAGTGGGTCCCATGCTGCCTGCTCATTAGC 
ATCACCTCAGCCCTCACCAGCCCATCAGAAGGATCATCCCAAGCTGAGGAAAGTTGCTCATC 
TTCTTCACATCATCAAACCTTTGGCCTGACTGATGCCTCCCGGATGCTTAAATGTGGTCACT 
GACATCTTTATTTTTCTATGATTTCAAGTCAGAACCTCCGGATCAGGAGGGAACACATAGTG 
GGAATGTACCCTCAGCTCCAAGGCCAGATCTTCCTTCAATGATCATGCATGCTACTTAGGAA 
GGTGTGTGTGTGTGAATGTAGAATTGCCTTTGTTATTTTTTCTTCCTGCTGTCAGGAACATT 
TTGAATACCAGAGAAAAAGAAAAGTGCTCTTCTTGGCATGGGAGGAGTTGTCACACTTGCAA 
AATAAAGGATGCAGTCCCAAATGTTCATAATCTCAGGGTCTGAAGGAGGATCAGAAACTGTG 
TATACAATTTCAGGCTTCTCTGAATGCAGCTTTTGAAAGCTGTTCCTGGCCGAGGCAGTACT 
AGTCAGAACCCTCGGAAACAGGAAGAAATGTCTTCAAGGTGCAGCAGGAGGAAACACCTTGC 
CCATCATGAAAGTGAATAACCACTGCCGCTGAAGGAATCCAGCTCCTGTTTGAGCAGGTGCT 
GCACACTCCCACACTGAAACAACAGTTCATTTTTATAGGACTTCCAGGAAGGATCTTCTTCT 
TAAGCTTCTTAATTATGGTACATCTCCAGTTGGCAGATGACTATGACTACTGACAGGAGAAT 
GAGGAACTAGCTGGGAATATTTCTGTTTGACCACCATGGAGTCACCCATTTCTTTACTGGTA 
TTTGGAAATAATAATTCTGAATTGCAAAGCAGGAGTTAGCGAAGATCTTCATTTCTTCCATG 
TTGGTGACAGCACAGTTCTGGCTATGAAAGTCTGCTTACAAGGAAGAGGATAAAAATCATAG 
GGATAATAAATCTAAGTTTGAAGACAATGAGGTTTTAGCTGCATTTGACATGAAGAAATTGA 
GACCTCTACTGGATAGCTATGGTATTTACGTGTCTTTTTGCTTAGTTACTTATTGACCCCAG 
CTGAGGTCAAGTATGAACTCAGGTCTCTCGGGCTACTGGCATGGATTGATTACATACAACTG 
TAATTTTAGCAGTGATTTAGGGTTTATGAGTACTTTTGCAGTAAATCATAGGGTTAGTAATG 
TTAATCTCAGGGAAAAAAAAAAAAAGCCAACCCTGACAGACATCCCAGCTCAGGTGGAAATC 
AAGGATCACAGCTCAGTGCGGTCCCAGAGAACACAGGGACTCTTCTCTTAGGACCTTTATGT 
ACAGGGCCTCAAGATAACTGATGTTAGTCAGAAGACTTTCCATTCTGGCCACAGTTCAGCTG 
AGGCAATCCTGGAATTTTCTCTCCGCTGCACAGTTCCAGTCATCCCAGTTTGTACAGTTCTG 
GCACTTTTTGGGTCAGGCCGTGATCCAAGGAGCAGAAGTTCCAGCTATGGTCAGGGAGTGCC 
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TGACCGTCCCAACTCACTGCACTCAAACAAAGGCGAAACCACAAGAGTGGCTTTTGTTGAAA 
TTGCAGTGTGGCCCAGAGGGGCTGCACCAGTACTGGATTGACCACGAGGCAACATTAATCCT 
CAGCAAGTGCAATTTGCAGCCATTAAATTGAACTAACTGATACTACAATGCAATCAGTATCA 
ACAAGTGGTTTGGCTTGGAAGATG6AGTCTAGGGGCTCTACAGGAGTAGCTACTCTCTAATG 
GAGTTGCATTTTGAAGCAGGACACTGTGAAAAGCTGGCCTCCTAAAGAGGCTGCTAAACATT 
AGGGTCAATTTTCCAGTGCACTTTCTGAAGTGTCTGCAGTTCCCCATGCAAAGCTGCCCAAA 
CATAGCACTTCCAATTGAATACAATTATATGCAGGCGTACTGCTTCTTGCCAGCACTGTCCT 
TCTCAAATGAACTCAACAAACAATTTCAAAGTCTAGTAGAAAGTAACAAGCTTTGAATGTCA 
TTAAAAAGTATATCTGCTTTCAGTAGTTCAGCTTATTTATGCCCACTAGAAACATCTTGTAC 
AAGCTGAACACTGGGGCTCCAGATTAGTGGTAAAACCTACTTTATACAATCATAGAATCATA 
GAATGGCCTGGGTTGGAAGGGACCCCAAGGATCATGAAGATCCAACACCCCCGCCACAGGCA 
GGGCCACCAACCTCCAGATCTGGTACTAGACCAGGCAGCCCAGGGCTCCATCCAACCTGGCC 
ATGAACACCTCCAGGGATGGAGCATCCACAACCTCTCTGGGCAGCCTGTGCCAGCACCTCAC 
CACCCTCTCTGTGAAGAACTTTTCCCTGACATCCAATGTAAGCCTTCCCTCCTTGAGGTTAG 
ATCCACTCCCCCTTGTGCTATCACTGTCTACTCTTGTAAAAAGTTGATTCTCCTCCTTTTTG 
GAAGGTTGCAATGAGGTCTCCTTGCAGCCTTCTTCTCTTCTGCAGGATGAACAAGCCCAGCT 
CCCTCAGCCTGTCTTTATAGGAGAGGTGCTCCAGCCCTCTGATGATCTTTGTGGCCCTCCTC 
TGGACCCGCTCCAAGAGCTCCACATCTTTCCTGTACTGGGGGCCCCAGGCCTGAATGCAGTA 
CTCCAGATGGGGCCTCAAAAGAGCAGAGTAAAGAGGGACAATCACCTTCCTCACCCTGCTGG 
CCAGCCCTCTTCTGATGGAGCCCTGGATACAACTGGCTTTCTGAGCTGCAACTTCTCGTTAT 
CAGTTCCACTATTAAAACAGGAACAATACAACAGGTGCTGATGGCCAGTGCAGAGTTTTTCA 
CACTTCTTCATTTCGGTAGATCTTAGATGAGGAACGTTGAAGTTGTGCTTCTGCGTGTGCTT 
CTTCCTCCTCAAATACTCCTGCCTGATACCTCACCCCACCTGCCACTGAATGGCTCCATGGC 
CCCCTGCAGCCAGGGCCCTGATGAACCCGGCACTGCTTCAGATGCTGTTTAATAGCACAGTA 
TGACCAAGTTGCACCTATGAATACACAAACAATGTGTTGCATCCTTCAGCACTTGAGAAGAA 
GAGCCAAATTTGCATTGTCAGGAAATGGTTTAGTAATTCTGCCAATTAAAACTTGTTTATCT 
ACCATGGCTGTTTTTATGGCTGTTAGTAGTGGTACACTGATGATGAACAATGGCTATGCAGT 
AAAATCAAGACTGTAGATATTGCAACAGACTATAAAATTCCTCTGTGGCTTAGCCAATGTGG 
TACTTCCCACATTGTATAAGAAATTTGGCAAQTTTAGAGCAATGTTTGAAGTGTTGGGAAAT 
TTCTGTATACTCAAGAGGGCGTTTTTGACAACTGTAGAACAGAGGAATCAAAAGGGGGTGGG 
AGGAAGTTAAAAGAAGAGGCAGGTGCAAGAGAGCTTGCAGTCCCGCTGTGTGTACGACACTG 
GCAACATGAdGTCTTTGCTAATCTTGGTGCTTTGCTTCCTGCCCCTGGCTGCCTTAGGGTGC 
GATCTGCCTCAGACCCACAGCCTGGGCAGCAGGAGGACCCTGATGCTGCTGGCTCAGATGAG 
GAGAATCAGCCTGTTTAGCTGCCTGAAGGATAGGCACGATTTTGGCTTTCCTCAAGAGGAGT 
TTGGCAACCAGTTTCAGAAGGCTGAGACGATCCCTGTGCTGCACGAGATGATCCAGCAGATC 
TTTAACCTGTTTAGCACCAAGGATAGCAGCGCTGCTTGGGATGAGACCCTGCTGGATAAGTT 
TTACACCGAGCTGTACCAGCAGCTGAACGATCTGGAGGCTTGCGTGATCCAGGGCGTGGGCG 
TGACCGAGACCCCTCTGATGAAGGAGGATAGCATCCTGGCTGTGAGGAAGTACTTTCAGAGG 
ATCACCCTGTACCTGAAGGAGAAGAAGTACAGCCCCTGCGCTTGGGAAGTCGTGAGGGCTGA 
GATCATGAGGAGCTTTAGCCTGAGCACCAACCTGCAAGAGAGCTTGAGGTCTAAGGAGTAAA 
AAGTCTAGAGTCGGGGCGGCCGGCCGCTTCGAGCAGACATGATAAGATACATTGATGAGTTT 
GGACAAACCACAACTAGAATGCAGTGAAAAAAATGCTTTATTTGTGAAATTTGTGATGCTAT 
TGCTTTATTTGTAACCATTATAAGCTGCAATAAACAAGTTAACAA.CAACAATTGCATTCATT 
TTATGTTTCAGGTTCAGGGGGAGGTGTGGGAGGTTTTTTAAAGCAAGTAAAACCTCTACAAA 
TGTGGTAAAATCGATAAGGATCCGTCGACCGATGCCCTTGAGAGCCTTCAACCCAGTCAGCT 
CCTTCCGGTGGGCGCGGGGCATGACTATCGTCGCCGCACTTATGACTGTCTTCTTTATCATG 
CAACTCGTAGGACAGGTGCCGGCAGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCGCT 
CGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGGCGGTAATACGGTTATCCACA 
GAATCAGGGGATAACGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCG 
TAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAA 
ATCGACGCTCAAQTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCC 
CCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGC 
CTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCAATGCTCACGCTGTAGGTATCTCAGTTCGG 
TGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGC 
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GeCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTGGC 
AGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGA 
AGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAG 
CCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAG 
CGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGATC 
CTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTG 
GTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAA 
ATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGG 
CACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAG 
ATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGACCC 
ACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAA 
GTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTA 
AGTA6TTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTACAGGCATCGTGGTGTC 
ACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACAT 
GATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGT 
AAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCAT 
GCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGT 
GTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAATACGGGATAATACCGCGCCACATAGC 
AGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTT 
ACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTT 
TTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGA 
ATAAGGGCGAGACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCAT 
TTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAA 
TAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGCGCCCTGTAGCGGCGCATTA 
AGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGACCGCTACACTTGCCAGCGCCCTAGCGCC 
CGCTCCTTTCGCTTTCTTCCCTTCCTTTCTCGCCACGTTCGCCGGCTTTCCCCGTCAAGCTC 
TAAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCACCTCGACCCCAAAAAA 
CTTGATTAGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGATAGACGGTTTTTCGCCCTTT 
GACGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTGTTCCAAACTGGAACAACACTCAACC 
CTATCTCGGTCTATTCTTTTGATTTATAAGGGATTTTGCCGATTTCGGCCTATTGGTTAAAA 
AATGAGCTGATTTAACAAAAATTTAACGCGAATTTTAACAAAATATTAACGTTTACAATTTC 
CCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGATCGGTGCGGGCCTCTTCGCTAT 
TACGCCAGCCCAAGCTACCATGATAAGTAAGTAATATTAAGGTACGGGAGGTACTTGGAGCG 
GCCGCTCTAGAACTAGTGGATCCCCCGGCCGCAATAAAATATCTTTATTTTCATTACATCTG 
TGTGTTGGTTTTTTGTGTGAATCGATAGTACTAACATACGCTCTCCATCAAAACAAAACGAA 
ACAAAACAAACTAGCAAAATAGGCTGTCCCCAGTGCAAGTGCAGGTGCCAGAACATTTCTCT 
ATCGATAGGTACCGAGCTCTTACGCGTGCTAGCCCTCGAGCAGGATCTATACATTGAATCAA 
TATTGGCAATTAGCCATATTAGTCATTGGTTATATAGCATAAATCAATATTGGCTATTGGCC 
ATTGCATACGTTGTATCTATATCATAATATGTACATTTATATTGGCTCATGTCCAATATGAC 
CGCCATGTTGACATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCATTAGTT 
CATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACC 
GCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAG 
GGACTTTCCATTGACGTCAATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGTACAT 
CAAGTGTATCATATGCCAAGTCCGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTG 
GCATTATGCCCAGTACATGACCTTACGGGACTTTCCTACTTGGCAGTACATCTACGTATTAG 
TCATCGCTATTACCATGGTGATGCGGTTTTGGCAGTACATCAATGGGCGTGGATAGCGGTTT 
GACTCACGGGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCA 
AAATCAACGGGACTTTCCAAAATGTCGTAACAACTCCGCCCCATTGACGCAAATGGGCGGTA 
GGCGTGTACGGTGGGAGGTCTATATAAGCAGAGCTCGTTTAGTGAACCGTCAGATCGCCTGG 
AGACGCCATCCACGCTGTTTTGACCTCCATAGAAGACACCGGGACCGATCCAGCCTCCCCTC 
GAAGCTCGACTCTAGGGGCTCGAGATCTGCGATCTAAGTAAGCTTGCATGCCTGCAGGTCGG 
CCGCCACGACCGGTGCCGCCACCATCCCCTGACGCACGCCCCTGACCCCTCACAAGGAGACG 
ACCTTCCATGACCGAGTACAAGCCCACGGTGCGCCTCGCCACCGGCGACGACGTCCCCCGGG 
CCGTACGCACCCTCGCCGCCGCGTTCGCCGACTACCCCGCCACGCGCCACACCGTCGACCCG 
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GACCGCCACATCGAGCGGGTCACCGAGCTGCAAGAACTCTTCCTCACGCGCGTCGGGCTCGA 
CATCGGCAAGGTGTGGGTCGCGGACGACGGCGCCGCGGTGGCGGTCTGGACCACGCCGGAGA 
GCGTCGAAGCGGGGGCGGTGTTCGCCGAGATCGGCCCGCGCATGGCCGAGTTGAGCGGTTCC 
CGGCTGGCCGCGCAGCAACAGATGGAAGGCCTCCTGGCGCCGCACCGGCCCAAGGAGCCCGC 
GTGGTTCCTGGCCACCGTCGGCGTCTCGCCCGACCACCAGGGCAAGGGTCTGGGCAGCGCCG 
TCGTGCTCCCCGGAGTGGAGGCGGCCGAGCGCGCCGGGGTGCCCGCCTTCCTGGAGACCTCC 
GCGCCCCGCAACCTCCCCTTCTACGAGCGGCTCGGCTTCACCGTCACCGCCGACGTCGAGGT 
GCCCGAAGGACCGCGCACCTGGTGCATGACCCGCAAGCCCGGTGCCTGACGCCCGCCCCACG 
ACCCGCAGCGCCCGACCGAAAGGAGCGCACGACCCCATGGCTCCGACCGAAGCCGACCCGGG 
CGGCCCCGCCGACCCCGCACCCGCCCCCGAGGCCCACCGACTCTAGAGTCGGGGCGGCCGGC 
CGCTTCGAGCAGACATGATAAGATACATTGATGAGTTTGGACAAACCACAACTAGAATGCAG 
TGAAAAAAATGCTTTATTTGTGAAATTTGTGATGCTATTGCTTTATTTGTAACCATTATAAG 
CTGCAATAAACT^GTTAACAAC^CaATTGCATTCATTTTATGTTTCAGGTTCAGGGGGAGG 
TGTGGGAGGTTTTTTAAAGCAAGTAAAACCTCTACAAATGTGGTAAAATCGATAAGGATCAA 
TTCGGCTTCAGGTACCGTCGACGATGTAGGTCACGGTCTCGAAGCCGCGGTGCGGGTGCCAG 
GGCGTGCCCTTGGGCTCCCCGGGCGCGTACTCCACCTCACCCATCTGGTCCATCATGATGAA 
CGGGTCGAGGTGGCGGTAGTTGATCCCGGCGAACGCGCGGCGCACCGGGAAGCCCTCGCCCT 
CGAAACCGCTGGGCGCGGTGGTCACGGTGAGCACGGGACGTGCGACGGCGTCGGCGGGTGCG 
GATACGCGGGGCAGCGTCAGCGGGTTCTCGACGGTCACGGCGGGCATGTCGACAGCCGAATT 
GATCCGTCGACCGATGCCCTTGAGAGCCTTCAACCCAGTCAGCTCCTTCCGGTGGGCGCGGG 
GCATGACTATCGTCGCCGCACTTATGACTGTCTTCTTTATCATGCAACTCGTAGGAGAGGTG 
CCGGCAGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGC 
GAGCGGTATCAGCTCACTCAAAGGCGGTAATACGGTTATCCACAGAATCAGGGGATAACGCA 
GGAAAGAACATG 
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pOM IFN-lns-CMV-pur-attB (SEQ ID NO: 8) 

GGCCGCCACCGCGGTGGAGCTCC7^TTCGCCCTATAGTGAGTCGTATTACAA.TTCACTGGCC 
GTCGTTTTACAACGTCGTGACTGGGAAAACCCTGGCGTTACCCAACTTAATCGCCTTGCAGC 
ACATCCCCCTTTCGCCAGCTGGCGTAATAGCGAAGAGGCCCGCACCGATCGCCCTTCCCAAC 
AGTTGCGCAGCCTGAATGGCGAATGGGACGCGCCCTGTAGCGGCGCATTAAGCGCGGCGGGT 
GTGGTGGTTACGCGCAGCGTGACCGCTACACTTGCCAGCGCCCTAGCGCCCGCTCCTTTCGC 
TTTCTTCCCTTCCTTTCTCGCCACGTTCGCCGGCTTTCCCCGTCAAGCTCTAAATCGGGGGC 
TCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCACCTCGACCCCAAAA2^CTTGATTAGGGT 
GATGGTTCAGGTAGTGGGCCATCGCCCTGATAGACGGTTTTTCGCCCTTTGACGTTGGAGTC 
CACGTTCTTTAATAGTGGACTCTTGTTCCAAACTGGAACAACACTCAACCCTATCTCGGTCT 
ATTCTTTTGATTTATAAGGGATTTTGCCGATTTCGGCCTATTGGTTAAAAAATGAGCTGATT 
TAACAAAAATTTAACGCGAATTTTAACAAAATATTAACGCTTACAATTTAGGTGGCACTTTT 
CGGGGAAATGTGCGCGGAACCCCTATTTGTTTATTTTTCTAAATACATTCAAATATGTATCC 
GCTCATGAGACAATAACCCTGATAAATGCTTCAATAATATTGAAAAA.GGAAGAGTATGAGTA 
TTCAACATTTCCGTGTCGCCCTTATTCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCT 
CACCCAGAAACGCTGGTGAAAGTAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTA 
CATCGAACTGGATCTCAACAGCGGTAAGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTC 
CAATGATGAGCACTTTTAAAGTTCTGCTATGTGGCGCGGTATTATCCCGTATTGACGCCGGG 
CAAGAGCAACTCGGTCGCCGCATACACTATTCTCAGAATGACTTGGTTGAGTACTCACCAGT 
CACAGAAAAGCATCTTACGGATGGCATGACAGTAAGAGAATTATGCAGTGCTGCCATAACCA 
TGAGTGATAACACTGCGGCCAACTTACTTCTGACAACGATCGGAGGACCGAAGGAGCTAACC 
GCTTTTTTGCACAA.CATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAA 
TGAAGCCATACCAAACGACGAGCGTGACACCACGATGCCTGTAGCAATGGCAACAACGTTGC 
GCAAACTATTAACTGGCGAACTACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATG 
GAGGCGGATAAAGTTGCAGGACCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGC 
TGATAAATCTGGAGCCGGTGAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGCCAGATG 
GTAAGCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAACTATGGATGAACGA 
AATAGACAGATCGCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTGTCAGACCAAGT 
TTACTCATATATACTTTAGATTGATTTAA?\ACTTCATTTTTAATTTAAAAGGATCTAGGTGA 
AGATCCTTTTTGATAATCTCATGACCAAAATCCCTTAACGTGAGTTTTCGTTCCACTGAGCG 
TCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTG 
CTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTAC 
CAACTCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTA 
GTGTAGCCGTAGTTAGGCCACCACTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCTCT 
GCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACT 
CAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAG 
CCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATACCTACAGCGTGAGCTATGAGAAAG 
CGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAG 
GAGAGCGCACGAGGGAGCTTCCAGGGGGAAACGCCTGGTATCTTTATAGTCCTGTCGGGTTT 
CGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGCCTATGGAA 
AAACGCCAGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGT 
TCTTTCCTGCGTTATCCCCTGATTCTGTGGATAACCGTATTACCGCCTTTGAGTGAGCTGAT 
ACCGCTCGCCGCAGCCGAACGACCGAGCGCAGCGAGTCAGTGAGCGAGGAAGCGGAAGAGCG 
CCCAA.TACGCAAACCGCCTCTCCCC6CGCGTTGGCCGATTCATTAATGCAGCTGGCACGACA 
GGTTTCCCGACTGGAAAGCGGGCAGTGAGCGCAACGCAATTAATGTGAGTTAGCTCACTCAT 
TAGGCACCCCAGGCTTTACACTTTATGCTTCCGGCTCGTATGTTGTGTGGAATTGTGAGCGG 
ATAACAATTTCACACAGGAAACAGCTATGACCATGATTACGCCAAGCTCGAAATTAACCCTC 
ACTAAAGGGAACAAAAGCTGGGTACCGGGCCCCCCCTCGACTAGAGGGACAGCCCCCCCCCA 
AAGCCCCCAGGGATGTAATTACGTCCCTCCCCCGCTAGGGGGCAGCAGCGAGCCGCCCGGGG 
CTCCGCTCCGGTCCGGCGCTCCCCCCGCATCCCCGAGCCGGCAGCGTGCGGGGACAGCCCGG 
GCACGGGGAAGGTGGCACGGGATCGCTTTCCTCTGAACGCTTCTCGCTGCTCTTTGAGCCTG 
CAGACACCTGGGGGGATACGGGGAAAAAGCTTTAGGCTGAAAGAGAGATTTAGAATGACAGA 
ATCATAGAACGGCCTGGGTTGCAAAGGAGCACAGTGCTCATCCAGATCCAACGCCCTGCTAT 
GTGCAGGGTCATCAACCAGCAGCCCAGGCTGCCCAGAGCCACATCCAGCCTGGCCTTGAATG 
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CCTGCAGGGATGGGGCATCCACAGCCTCCTTGGGCAACCTGTTCAGTGCGTCACCACCCTCT 
GGGGGAAAAACTGCCTCCTCATATCCAACCCAAACCTCCCCTGTCTCAGTGTAAAGCCATTC 
CCCCTTGTCCTATCAAGGGGGAGTTTGCTGTGACATTGTTGGTCTGGGGTGACACATGTTTG 
CCAATTCAGTGCATCACGGAGAGGCAGATCTTGGGGATAAGGAAGTGCAGGACAGCATGGAC 
GTGGGACATGCAGGTGTTGAGGGCTCTGGGACACTCTCCAAGTCACAGCGTTCAGAACAGCC 
TTAAGGATAAGAAGATAGGATAGAAGGACAAAGAGCAAGTTAAAACCCAGCATGGAGAGGAG 
CACAAAAAGGCCACAGACACTGCTGGTCCCTGTGTCTGAGCCTGCATGTTTGATGGTGTCTG 
GATGCAAGCAGAAGGGGTGGAAGAGCTTGCCTGGAGAGATACAGCTGGGTCAGTAGGACTGG 
GACAGGCAGCTGGAGAATTGCCATGTAGATGTTCATACAATCGTCAAATCATGAAGGCTGGA 
AAAGCCCTCCy^GATCCCCAAGACCAACCCCAACCCACCCACCGTGCCCACTGGCCATGTCC 
CTCAGTGCCACATCCCCACAGTTCTTCATCACCTCCAGGGACGGTGACCCCCCCACCTCCGT 
GGGCAGCTGTGCCACTGCAGCACCGCTCTTTGGAGAAGGTAAATCTTGCTAAATCCAGCCCG 
ACCCTCCCCTGGCACAACGTAAGGCCATTATCTCTCATCCAACTCCAGGACGGAGTCAGTGA 
GGATGGGGCTCTAGTCGAGGTCGACGGTATCGATAAGCTTGATTAGGCAGAGCAATAGGACT 
CTCAACCTCGTGAGTATGGCAGCATGTTAACTCTGCACTGGAGTCCAGCGTGGGAAACAATC 
TGCCTTGCACATGAGTCTTCGTGGGCCAATATTCCCCAACGGTTTTCCTTCAGCTTGTCTTG 
TCTCCTAAGCTCTCAAAACACCTTTTTGGTGAATAAACTCACTTGGCAACGTTTATCTGTCT 
TACCTTAGTGTCACGTTTCATCCCTATTCCCCTTTCTCCTCCTCCGTGTGGTACACAGTGGT 
GCACACTGGTTCTTCTGTTGATGTTCTGCTCTGACAGCCAATGTGGGTAAAGTTCTTCCTGC 
CACGTGTCTGTGTTGTTTTCACTTCAAAAAGGGCCCTGGGCTCCCCTTGGAGCTCTCAGGCA 
TTTCCTTAATCATCACAGTCACGCTGGCAGGATTAGTCCCTCCTAAACCTTAGAATGACCTG 
AACGTGTGCTCCCTCTTTGTAGTCAGTGCAGGGAGACGTTTGCCTCAAGATCAGGGTCCATC 
TCACCCACAGGGCCATTCCCAAGATGAGGTGGATGGTTTACTCTCACAAAAAGTTTTCTTAT 
GTTTGGCTAGAAAGGAGAACTCACTGCCTACCTGTGAATTCCCCTAGTCCTGGTTCTGCTGC 
CACTGCTGCCTGTGCAGCCTGTCCCATGGAGGGGGCAGCAACTGCTGTCACAAAGGTGATCC 
CACCCTGTCTCCACTGAAATGACCTCAGTGCCACGTGTTGTATAGGGTATAAAGTACGGGAG 
GGGGATGCCCGGCTCCCTTCAGGGTTGCAGAGCAGAAGTGTCTGTGTATAGAGTGTGTCTTA 
ATCTATTAATGTAACAGAACAACTTCAGTCCTAGTGTTTTGTGGGCTGGAATTGCCCATGTG 
GTAGGGACAGGCCTGCTAAATCACTGCAATCGCCTATGTTCTGAAGGTATTTGGGAAAGAAA 
GGGATTTGGGGGATTGCCTGTGATTGGCTTTAATTGAATGGCAAATCACAGGAAAGCAGTTC 
TGCTCAACAGTTGGTTGTTTCAGCCAATTCTTGCAGCCAAAGAGCCGGGTGCCCAGCGATAT 
AATAGTTGTCACTTGTGTCTGTATGGATGACAGGGAGGTAGGGTGACCTGAGGACCACCCTC 
CAGCTTCTGCTAGCGTAGGTACAGTCACCACCTCCAGCTCCACACGAGTCCCATCGTGGTTT 
ACCAAAGAAACACAATTATTTGGACCAGTTTGGAAAGTCACCCGCTGAATTGTGAGGCTAGA 
TTAATAGAGCTGAAGAGCAAATGTTCCCJVACTTGGAGATACTAGTTGGTATTAGTATCAGAG 
GAACAGGGCCATAGCACCTCCATGCTATTAGATTCCGGCTGGCATGTACTTTTCAAGATGAT 
TTGTAACTAACAATGGCTTATTGTGCTTGTCTTAAGTCTGTGTCCTAATGTAAATGTTCCTT 
TGGTTTATATAACCTTCTTGCCATTTGCTCTTCAGGTGTTCTTGCAGAACACTGGCTGCTTT 
AATCTAGTTTAACTGTTGCTTGATTATTCTTAGGGATAAGATCTGAATAAACTTTTTGTGGC 
TTTGGCAGACTTTAGCTTGGGCTTAGCTCCCACATTAGCTTTTGCTGCCTTTTGTGTGAAGC 
TATCAAGATCCTACTCAATGACATTAGCTGGGTGCAGGTGTACCAAATCCTGCTCTGTGGAA 
CACATTGTCTGATGATACGGAAGGCAAACGTGAACTCAAAGAGGGACAGAGTTAAGAAGAAG 
TCTGTGCAATTCAGAGGAAAAGCCAAAGTGGCCATTAGACACACTTTCCATGCAGCATTTGC 
CAGTAGGTTTCATATAAAACTACAAAATGGAATAAACCACTACAAATGGGAAAAGCCTGATA 
CTAGAATTTAAATATTCACCCAGGCTCAAGGGGTGTTTCATGGAGTAATATCACTCTATAAA 
AGTAGGGCAGCCAATTATTCACAGACAAAQCTTTTTTTTTTCTGTGCTGCAGTGCTGTTTTT 
CGGCTGATCCAGGGTTACTTATTGTGGGTCTGAGAGCTGAATGATTTCTCCTTGTGTCATGT 
TGGTGAAGGAGATATGGCCAGGGGGAGATGAGCATGTTCAAGAGGAAACGTTGCATTTTGGT 
GGCTTGGGAGAAAGGTAGAACGATATCAGGTCCATAGTGTCACTAAGAGATCTGAAGGATGG 
TTTTACAGAACAGTTGACTTGGCTGGGTGCAGGCTTGGCTGTAAATGGATGGAAGGATGGAC 
AGATGGGTGGACAGAGATTTCTGTGCAGGAGATCATCTCCTGAGCTCGGTGCTTGACAGACT 
GCAGATCCATCCCATAACCTTCTCCAGCATGAGAGCGCGGGGAGCTTTGGTACTGTTCAGTC 
TGCTGCTTGTTGCTTCCTGGGTGCACAGTGGTGATTTTCTTACTCACACAGGGCAAAAACCT 
GAGCAGCTTCAAAGTGAACAGGTTGCTCTCATAGGCCATTCAGTTGTCAAGATGAGGTTTTT 
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GGTTTCTTGTTTTGTAAGGTGGGAAGAAGCACTGAAGGATCAGTTGCGAGGGCAGGGGTTTA 

GCACTGTTCAGAGAAGTCTTATTTTAACTCCTCTCATGAACAAAAAGAGATGCAGGTGCAGA 

TTCTGGCAAGCATGCAGTGAAGGAGAAAGCCCTGAATTTCTGATATATGTGCAATGTTGGGC 

ACCTAACATTCCCCGCTGAAGCACAGCAGCTCCAGCTCCATGCAGTACTCACAGCTGGTGCA 

GCCCTCGGCTCCAGGGTCTGAGCAGTGCTGGGACTCACGAGGTTCCATGTCTTTCACACTGA 

TAATGGTCCAATTTCTGGAATGGGTGCCCATCCTTGGAGGTCCCCAAGGCCAGGCTGGCTGC 

GTCTCCGAGCAGCCCGATCTGGTGGTGAGTAGCCAGCCCATGGCAGGAGTTAGAGCCTGATG 

GTCTTTAAGGTCCCTTCCAACCTAAGCCATCCTACGATTCTAGGAATCATGACTTGTGAGTG 

TGTATTGCAGAGGCAATATTTTAAAGTTATAAATGTTTTCTCCCCTTCCTTGTTTGTCAAAG 

TTATCTTGATCGCCTTATCAATGCTTTTGGAGTCTCCAGTCATTTTTCTTACAMCAAAAAGA 

GGAGGAAGAATGAAGAGAATCATTTAATTTCTTGATTGAATAGTAGGATTCAGAAAGCTGTA 

CGTAATGCCGTCTCTTTGTATCGAGCTGTAAGGTTTCTCATCATTTATCAGCGTGGTACATA 

TCAGCACTTTTCCATCTGATGTGGAAAAAAAAATCCTTATCATCTACAGTCTCTGTACCTAA 

ACATCGCTCAGACTCTTTACCAAAAAAGCTATAGGTTTTAAAACTACATCTGCTGATAATTT 

GCCTTGTTTTAGCTCTTCTTCCATATGCTGCGTTTGTGAGAGGTGCGTGGATGGGCCTAAAC 

TCTCAGCTGCTGAGCTTGATGGGTGCTTAAGAATGAAGCACTCACTGCTGAAACTGTTTTCA 

TTTCACAGGAATGTTTTAGTGGCATTGTTTTTATAACTACATATTCCTCAGATAAATGAAAT 

CCTlGAAATAATTATGCy^AACTC^CTGCATCCGTTGCACAGGTCTTTATCTGCTAGCAAAGGA 

AATAATTTGGGGATGGCAAAAACATTCCTTCAGACATCTATATTTAAAGGAATATAATCCTG 

GTACCCACCCACTTCATCCCTCATTATGTTCACACTCAGAGATACTCATTCTCTTGTTGTTA 

TCATTTGATAGCGTTTTCTTTGGTTCTTTGCCACGCTCTGGGCTATGGCTGCACGCTCTGCA 

CTGATCAGCAAGTAGATGCGAGGGAAGCAGCAGTGAGAGGGGCTGCCCTCAGCTGGCACCCT^ 

GCCGCTCAGCCTAGGAGGGGACCTTGCCTTTCCACCAGCTGAGGTGCAGCCCTACAAGCTTA 

CACGTGCTGCGAGCAGGTGAGCAAAGGGAGTCTTCATGGTGTGTTTCTTGCTGCCCGGAAGC 

AAAACTTTACTTTCATTCATTCCCCTTGAAGAATGAGGAATGTTTGGAAACGGACTGCTTTA 

CGTTCAATTTCTCTCTTCCCTTTAAGGCTCAGCCAGGGGCCATTGCTGAGGACGGCATCGGG 

GCCCCCTGGACCAAATCTGTGGCACAGATGGTTTCACTTACATCAGTGGATGTGGGATCTGC 

GCCTGTAATGTGTCCTTCTGAAGGAAGGAACGTGCCTTCCAAGTGCCAGCCCCACAGCCCCC 

AGCCCCTCCCTGTGCTGCTCCAATTCATCTCCTCTTCCTCCTTCTCCCTTTGCTGTTTGTGC 

TCGGGTAGAAATCATGAAGATTTAGAAGAGAAAACAAAATAACTGGAGTGGAAACCCAGGTG 

ATGCAGTTCATTCAGCTGTCATAGGTTTGTCGTTGCTATAGGTCTGTATCAGAGATGCTARC 

ACCACTTTGCTGTCGGTGCTTAACTCGGGTGAACTCTCCTTCACTCGCATCATTTGCGGGCC 

TTATTTACATCCCCAGCATCCATCACCCTCTGGGAAAATGGGCGCACTGGATCTCTAATGGA 

AGACTTTCCCTCTTTCAGAGCCTGTGGGATGTGCAGTGACAAGAAACGTGGAGGGGCTGAGC 

AGCAGCACTGCCCCCAGGGAGCAGGAGCGGATGCCATGGGTGGCAGCATCCCAAATGATGTG 

AGCGGATGCTGAGCAGGCAGCGGACGAACGGACAGAAGCGATGCGTACACCTTCTGTTGACA 

TGGTATTTGGCAGCGATTTAACACTCGCTTCCTAGTCCTGCTATTCTCCACAGGCTGCATTC 

AAATGAACGAAGGGAAGGGAGGCAAAAAGATGCAAAATCCGAGACAAGCAGCAGAAATATTT 

CTTCGCTACGGAAGCGTGCGCAAACAACCTTCTCCAACAGCACCAGAAGAGCACAGCGTAAC 

CTTTTTCAAGACCAGAAAAGGAAATTCACAAAGCCTCTGTGGATACCAGCGCGTTCAGCTCT 

CCTGATAGCAGATTTCTTGTCAGGTTGCGAATGGGGTATGGTGCCAGGAGGTGCAGGGACCA 

TATGATCATATACAGCACAGCAGTCATTGTGCATGTATTAATATATATTGAGTAGCAGTGTT 

ACTTTGCCAAAGCAATAGTTCAGAGATGAGTCCTGCTGCATACCTCTATCTTAAAACTAACT 

TATAAATAGTAAAACCTTCTCAGTTCAGCCACGTGCTCCTCTCTGTCAGCACCAATGGTGCT 

TCGCCTGCACCCAGCTGCAAGGAATCAGCCCGTGATCTCATTAACACTCAGCTCTGCAGGAT 

AAATTAGATTGTTCCACTCTCTTTTGTTGTTAATTACGACGGAACAATTGTTCAGTGCTGAT 

GGTCCTAATTGTCAGCTACAGAAAACGTCTCCATGCAGTTCCTTCTGCGCCAGCAAACTGTC 

CAGGCTATAGCACCGTGATGCATGCTACCTCTCACTCCATCCTTCTTCTCTTTCCCACCAGG 

GAGAGCTGTGTGTTTTCACTCTCAGCCACTCTGAACAATACCAAACTGCTACGCACTGCCTC 

CGTCGGAAAGAGAATCCCCTTGTTGCTTTTTTATTTACAGGATCCTTCTTAAAAAGCAGACC 

ATCATTCACTQCAAACCCAGAGCTTCATGCCTCTCCTTCCACAACCGAAAACAGCCGGCTTC 

ATTTGTCTTTTTTAAATGCTGTTTTCCAGGTGAATTTTGGCCAGCGTGTTGGCTGAGATCCA 

GGAGCACGTGTCAGCTTTCTGCTCTCATTGCTCCTGTTCTGCATTGCCTCTTTCTGGGGTTT 

CCAAGAGGGGGGGAGACTTTGCGCGGGGATGAGATAATGCCCCTTTTCTTAGGGTGGCTGCT 
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GGGCAGCAGAGTGGCTCTGGGTCACTGTGGCACCAATGGGAGGCACCAGTGGGGGTGTGTTT 

TGTGCAGGGGGGAAGCATTCACAGAATGGGGCTGATCCTGAAGCTTGCAGTCCAAGGCTTTG 

TCTGTGTACCCAGTGAAATCCTTCCTCTGTTACATAAAGCCCAGATAGGACTCAGAAATGTA 

GTCATTCCAGCCCCCCTCTTCCTCAGATCTGGAGCAGCACTTGTTTGCAGCCAGTCCTCCCC 

AAAATGCACAGACCTCGCCGAGTGGAGGGAGATGTAAACAGCGAAGGTTAATTACCTCCTTG 

TCAAAAACACTTTGTGGTCCATAGATGTTTCTGTCAATCTTACAAAACAGAACCGAGAGGCA 

GCGAGCACTGAAGAGCGTGTTCCCATGCTGAGTTAATGAGACTTGGCAGCTCGCTGTGCAGA 

GATGATCCCTGTGCTTCATGGGAGGCTGTAACCTGTCTCCCCATCGCCTTCACACCGCAGTG 

CTGTCCTGGACACCTCACCCTCCATAAGCTGTAGGATGCAGCTGCCCAGGGATCAAGAGACT 

TTTCCTAAGGCTCTTAGGACTCATCTTTGCCGCTCAGTAGCGTGCAGCAATTACTCATCCCA 

ACTATACTGAATGGGTTTCTGCCAGCTCTGCTTGTTTGTCAATAAGCATTTCTTCATTTTGC 

CTCTAAGTTTCTCTCAGCAGCACCGCTCTGGGTGACCTGAGTGGCCACCTGGAACCCGAGGG 

GCACAGCCACCACCTCCCTGTTGCTGCTGCTCCAGGGACTCATGTGCTGCTGGATGGGGGGA 

AGCATGAAGTTCCTCACCCAGACACCTGGGTTGCAATGGCTGCAGCGTGCTCTTCTTGGTAT 

GCAGATTGTTTCCAGCCATTACTTGTAGAAATGTGCTGTGGAAGCCCTTTGTATCTCTTTCT 

GTGGCCCTTCAGCAAAAGCTGTGGGAAAGCTCTGAGGCTGCTTTCTTGGGTCGTGGAGGAAT 

TGTATGTTCCTTCTTTAACAAAAATTATCCTTAGGAGAGAGCACTGTGGAAGCATTGTGCAC 

ATAAAACAATTCAGGTTGAAAGGGCTCTCTGGAGGTTTCCAGCCTGACTACTGCTCGAAGCA 

AGGCCAGGTTCAAAGATGGCTCAGGATGCTGTGTGCCTTCCTGATTATCTGTGCCACCAATG 

GAGGAGATTCACAGCCACTCTGCTTCCCGTGCCACTCATGGAGAGGAATATTCCCTTATATT 

CAGATAGAATGTTATCCTTTAGCTCAGCCTTCCCTATAACCCCATGAGGGAGCTGCAGATCC 

CCATACTCTCCCCTTCTCTGGGGTGAAGGCCGTGTCCCCCAGCCCCCCTTCCCACCCTGTGC 

CCTAAGCAGCCCGCTGGCCTCTGCTGGATGTGTGCCTATATGTCAATGCCTGTCCTTGCAGT 

CCAGCCTGGGACATTTAATTCATCACCAGGGTAATGTGGAACTGTGTCATCTTCCCCTGCAG 

GGTACAAAGTTCTGCACGGGGTCCTTTCGGTTCAGGAAAACCTTCACTGGTGCTACCTGAAT 

CAAGCTCTATTTAATAAGTTCATAAGCACATQGATGTGTTTTCCTAGAGATACGTTTTAATG 

GTATCAGTGATTTTTATTTGCTTTGTTGCTTACTTCAAACAGTGCCTTTGGGCAGGAGGTGA 

GGGACGGGTCTGCCGTTGGCTCTGCAGTGATTTCTCCAGGCGTGTGGCTCAGGTCAGATAGT 

GGTCACTCTGTGGCCAGAAGAAGGACAAAGATGGAAATTGCAGATTGAGTCACGTTAAGCAG 

GCATCTTGGAGTGATTTGAGGCAGTTTCATGAAAGAGCTACGACCACTTATTGTTGTTTTCC 

CCTTTTACAACAGAAGTTTTCATCAAAATAACGTGGCAAAGCCCAGGAATGTTTGGGAAAAG 

TGTAGTTAAATGTTTTGTAATTCATTTGTCGGAGTGCTACCA6CTAAGAAAAAAGTCCTACC 

TTTGGTATGGTAGTCCTGCAGAGAATACAACATCAATATTAGTTTGGAAAAAAACACCACCA 

CCACCAGAAACTGTAATGGAAAATGTAAACCAAGAAATTCCTTGGGTAAGAGAGAAAGGATG 

TCGTATACTGGCCAAGTCCTGCCCAGCTGTCAGCCTGCTGAGCCTCTGCAGTTCAGGACCAT 

GAAACGTGGCACTGTAAGACGTGTCCCCTGCCTTTGCTTGCCCACAGATCTCTGCCCTTGTG 

CTGACTCCTGCACACAAGAGCATTTCCCTGTAGCCAAACAGCGATTAGCCATAAGCTGCACC 

TGACTTTGAGGATTAAGAGTTTGCAATTAAGTGGATTGCAGCAGGAGATCAGTGGCAGGGTT 

GCAGATGAAATCCTTTTCTAGGGGTAGCTAAGGGCTGAGCAACCTGTCCTACAGCACAAGCC 

AAACCAGCCAAGGGTTTTCCTGTGCTGTTCACAGAGGCAGGGCCAGCTGGAGCTGGAGGAGG 

TTGTGCTGGGACCCTTCTCCCTGTGCTGAGAATGGAGTGATTTCTGGGTGCTGTTCCTGTGG 

CTTGCACTGAGCAGCTCAAGGGAGATCGGTGCTCCTCATGCAGTGCCAAAACTCGTGTTTGA 

TGCAGAAAGATGGATGTGCACCTCCCTCCTGCTAATGCAGCCGTGAGCTTATGAAGGCAATG 

AGCCCTCAGTGCAGCAGGAGCTGTAGTGCACTCCTGTAGGTGCTAGGGAAAATCTCTGGTTC 

CCAGGGATGCATTCATAAGGGCAATATATCTTGAGGCTGCGCCAAATCTTTCTGAAATATTC 

ATGCGTGTTCCCTTAATTTATAGAAACAAACACAGCAGAATAATTATTCCAATGCCTCCCCT 

CGAAGGAAACCCATATTTCCATGTAGAAATGTAACCTATATACACACAGCCATGCTGCATCC 

TTCAGAACGTGCCAGTGCTCATCTCCCATGGCAAAATACTACAGGTATTCTCACTATGTTGG 

ACCTGTGAAAGGAACCATGGTAAGAAACTTCGGTTAAAGGTATGGCTGCAAAACTACTCATA 

CCAAAACAGCAGAGCTCCAGACCTCCTCTTAGGAAAGAGCCACTTGGAGAGGGATGGTGTGA 

AGGCTGGAGGTGAGAGACAGAGCCTGTCCCAGTTTTCCTGTCTCTATTTTCTGAAACGTTTG 

CAGGAGGAAAGGACAACTGTACTTTCAGGCATAGCTGGTGCCCTCACGTAAATAAGTTCCCC 

GAACTTCTGTGTCATTTGTTCTTAAGATGCTTTGGCAGAACACTTTGAGTCAATTCGCTTAA 

CTGTGACTAGGTCTGTAAATAAGTGCTCCCTGCTGATAAGGTTCAAGTGACATTTTTAGTGG 
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TATTTGACAGCATTTACCTTGCTTTCAAGTCTTCTACCAAGCTCTTCTATACTTAAGCAGTG 

AAACCGCCAAGAAACCCTTCCTTTTATCAAGCTAGTGCTAAATACCATTAACTTCATAGGTT 

AGATACGGTGCTGCCAGCTTCACCTGGCAGTGGTTGGTCAGTTCTGCTGGTGACAAAGCCTC 

CCTGGCCTGTGCTTTTACCTAGAGGTGAATATCCAAGAATGCAGAACTGCATGGAAAGCAGA 

GCTGCAGGCACGATGGTGCTGAGCCTTAGCTGCTTCCTGCTGGGAGATGTGGATGCAGAGAC 

GAATGAAGGACCTGTCCCTTACTCCCCTCAGCATTCTGTGCTATTTAGGGTTCTACCAGAGT 

CCTTAAGAGGTTTTTTTTTTTTTTGGTCCAAAAGTCTGTTTGTTTGGTTTTGACCACTGAGA 

GCATGTGACACTTGTCTCAAGCTATTAACCAAGTGTCCAGCCAAAATCAATTGCCTGGGAGA 

CGCAGACCATTACCTGGAGGTCAGGACCTCAATAAATATTACCAGCCTCATTGTGCCGCTGA 

CAGATTCAGCTGGCTGCTCCGTGTTCCAGTCCAACAGTTCGGACGCCACGTTTGTATATATT 

TGCAGGCAGCCTCGGGGGGACCATCTCAGGAGCAGAGCACCGGCAGCCGCCTGCAGAGCCGG 

GCAGTACCTCACCATGGCTTTGACCTTTGCCTTACTGGTGGCTCTCCTGGTGCTGAGCTGCA 

AGAGCAGCTGCTCTGTGGGCTGCGATCTGCCTCAGACCCACAGCCTGGGCAGCAGGAGGACC 

CTGATGCTGCTGGCTCAGATGAGGAGAATCAGCCTGTTTAGCTGGCTGAAGGATAGGCACGA 

TTTTGGCTTTCCTCAAGAGGAGTTTGGCAACCAGTTTCAGAAGGCTGAGACCATCCCTGTGC 

TGCACGAGATGATCCAGCAGATCTTTAACCTGTTTAGCACCAAGGATAGCAGCGCTGCTTGG 

GATGAGACCCTGCTGGATAAGTTTTACACCGAGCTGTACCAGCAGCTGAACGATCTGGAGGC 

TTGCGTGATCCAGGGCGTGGGCGTGACCGAGACCCCTCTGATGAAGGAGGATAGCATCCTGG 

CTGTGAGGAAGTACTTTCAGAGGATCACCCTGTACCTGAAGGAGAAGAAGTACAGCCCCTGC 

GCTTGGGAAGTCGTGAGGGCTGAGATCATGAGGAGCTTTAGCCTGAGCACCAACCTGCAAGA 

GAGCTTGAGGTCTAAGGAGTAAAAAGTCTAGAGTCGGGGCGGCCGGCCGCTTCGAGCAGACA 

TGATAAGATACATTGATGAGTTTGGACAAACCACAACTAGAATGCAGTGAAAAAAATGCTTT 

ATTTGTGAAATTTGTGATGCTATTGCTTTATTTGTAACCATTATAAGCTGCAATAAACAAGT 

TAACAACAACAATTGCATTCATTTTATGTTTCAGGTTCAGGGGGAGGTGTGGGAGGTTTTTT 

AAAGCAAGTAAAACCTCTACAAATGTGGTAAAATCGATACCGTCGACCTCGACTAGAGCGGC 

CACTAACATACGCTCTCCATCAAAACAAAACGAAACAAAACAAACTAGCAAAATAGGCTGTC 

CCCAGTGCAAGTGCAGGTGCCAGAACATTTCTCTATCGATAGGTACCGAGCTCTTACGCGTG 

CTAGCCCTCGAGCAGGATCTATACATTGAATCAATATTGGCAATTAGCCATATTAGTCATTG 

GTTATATAGCATAAATCAATATTGGCTATTGGCCATTGCATACGTTGTATCTATATCATAAT 

ATGTACATTTATATTGGCTCATGTCCAATATGACCGCCATGTTGACATTGATTATTGACTAG 

TTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCATATATGGAGTTCCGCGTTA 

CATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGACGTCA 

ATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGA 

GTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTCCGCCCC 

CTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTACGG 

GACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGTGATGCGGTT 

TTGGCAGTACATCAATGGGCGTGGATAGCGGTTTGACTCACGGGGATTTCCAAGTCTCCACC 

CCATTGACGTCAATGGGAGTTTGTTTTGGCACCAAAATCAACGGGACTTTCCAAAATGTCGT 

AACAACTCCGCCCCATTGACGCAAATGGGCGGTAGGCGTGTACGGTGGGAGGTCTATATAAG 

CAGAGCTCGTTTAGTGAACCGTCAGATCGCCTGGAGACGCCATCCACGCTGTTTTGACCTCC 

ATAGAAGACACCGGGACCGATCCAGCCTCCCCTCGAAGCTCGACTCTAGGGGCTCGAGATCT 

GCGATCTAAGTAAGCTTGCATGCCTGCAGGTCGGCCGCCACGACCGGTGCCGCCACCATCCC 

CTGACCCACGCCCCTGACCCCTCACAAGGAGACGACCTTCCATGACCGAGTACAAGCCCACG 

GTGCGCGTCGCCACCCGCGACGACGTCCCCCGGGCCGTACGCACCCTCGCCGCCGCGTTCGC 

CGACTACCCCGCCACGCGCCACACCGTCGACCCGGACCGCCACATCGAGCGGGTCACCGAGC 

TGCAAGAACTCTTCCTCACGC6CGTCGGGCTCGACATCGGCAAGGTGTGGGTCGCGGACGAC 

GGCGCCGCGGTGGCGGTCTGGACCACGCCGGAGAGCGTCGAAGCGGGGGCGGTGTTCGCCGA 

GATCGGCCCGCGCATGGCCGAGTTGAGCGGTTCCCGGCTGGCCGCGCAGCAACAGATGGAAG 

GCCTCCT6GCGCCGCACCGGCCCAAGGAGCCCGCGTGGTTCCTGGCCACCGTCGGCGTCTCG 

CCCGACCACCAGGGCAAGGGTCTGGGCAGCGCCGTCGTGCTCCCCGGAGTGGAGGCGGCCGA 

GCGCGCCGGGGTGCCCGCCTTCCTGGAGACCTCCGCGCCCCGCAACCTCCCCTTCTACGAGC 

GGCTCGGCTTCACCGTCACCGCCGACGTCGAGGTGCCCGAAGGACCGCGCACCTGGTGCATG 

ACCCGCAAGCCCGGTGCCTGACGCCCGCCCCACGACCCGCAGCGCCCGACCGAAAGGAGCGC 

ACGACCCCATGGCTCCGACCGAAGCCGACCCGGGCGGCCCCGCCGACCCCGCACCCGCCCCC 
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GAGGCCCACCGACTCTAGAGTCGGGGCGGCCGGCCGCTTCGAGCAGACATGATAAGATACAT 
TGATGAGTTTGGACAAACCACAACTAGAATGCAGTGAAAAAAATGCTTTATTTGTGAAATTT 
GTGATGCTATTGCTTTATTTGTAACCATTATAAGCTGCAATAAACAAGTTAACAACAAGAAT 
TGCATTCATTTTATGTTTCAGGTTCAGGGGGAGGTGTGGGAGGTTTTTTAAAGCAAGTAAAA 
CCTCTACAAATGTGGTAAAATCGATAAGGATCAATTCGGCTTCAGGTACCGTCGACGATGTA 
GGTCACGGTCTCGAAGCCGCGGTGCGGGTGCCAGGGCGTGCCCTTGGGCTCCCCGGGCGCGT 
ACTCCACCTCACCCATCTGGTCCATCATGATGAACGGGTCGAGGTGGCGGTAGTTGATCCCG 
GCGAACGCGCGGCGCACCGGGAAGCCCTCGCCCTCGAAACCGCTGGGCGCGGTGGTCACGGT 
GAGCACGGGACGTGCGACGGCGTCGGCGGGTGCGGATACGCGGGGCAGCGTCAGCGGGTTCT 
CGACGGTCACGGCGGGCATGTCGACAGCCGAATTGATCCGTCGACCGATGCCCTTGAGAGCC 
TTCAACCCAGTCAGCTCCTTCCGGTaGGCGCGGGGCATGACTATCGTCGCCGCACTTATGAC 
TGTCTTCTTTATCATGCAAGTGGTAGGACAGGTGCCGGCAGC 
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pRSV-C31int (SEQ ID NO: 9) 

CTGCATTAATGAATCGGCCAACGCGCGGGGAGAGGCGGTTTGCGTATTGGGCGCTCTTCC 
GCTTCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCT 
CACTCAAAGGCGGTAATACGGTTATCCACAGAATCAGGGGATAACGCAGGAAAGAACATG 
TGAGCAAAAGGCa^GCyUU^GGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTC 
CATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGA 
AACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCT 
CCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTG 
GCGCTTTCTCAATGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAG 
CTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTAT 
CGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTGGCAGCAGCCACTGGTAAC 
AGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAAC 
TACGGCTACACTAGAA6GACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTTC 
GGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAGCGGTGGTTTT 
TTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGATC 
TTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATG 
AGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCA 
ATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCA 
CCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAG 
ATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGAC 
CCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGC 
AGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCT 
AGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTACAGGCATC 
GTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGG 
CGAGTTACATGATCCCCGATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATC 
GTTGTCAGAAGTAAGTTGGCCGCAGTGTTAT CAC T CATGGTTATGGCAGCACTGCATAAT 
TCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAG 
TCATTCTGAGAATAGTGTATGCGGCGACCGAGTT6CTCTTGCCCGGCGTCAATACGGGAT 
AATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGG 
CGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCA 
CCCaACTGATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGA 
AGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGAAATGTTGAATACTCATACTC 
TTCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATA 
TTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTG 
CCACCTGACGTCGACGGATCGGGAGATCTCCCGATCCCCTATGGTCGACTCTCAGTACAA 
TCTGCTCTGATGCCGCATAGTTAAGCCAGTATCTGCTCCCTGCTTGTGTGTTGGAGGTCG 
CTGA6TAGTGCGCGAGCAAAATTTAAGCTACAACAAGGCAAGGCTTGACCGACAATTGCA 
TGAAGAATCTGCTTAGGGTTAGGCGTTTTGCGCTGCTTCGCGATGTACGGGCCAGATATA 
CGCGTGCTAGGGGTCTAGGATCGATTCTAGGAATTCTCTAGCCGCGGTCTAGGGATCCCG 
GCGCGTATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAAGCCAGTATCT 
GCTCCCTGCTTGTGTGTTGGAGGTCGCTGAGTAGTGCGCGAGCAAAATTTAAGCTACAAC 
AAGGCAAGGCTTGACCGACAATTGCATGAAGAATCTGCTTAGGGTTAGGCGTTTTGCGCT 
GCTTCGCGATGTACGGGCCAGATATACGCGTATCTGAGGGGACTAGGGTGTGTTTAGGCG 
AAAAGCGGGGCTTCGGTTGTACGCGGTTAGGAGTCCCCTCAGGATATAGTAGTTTCGCTT 
TTGCATAGGGAGGGGGAAATGTAGTCTTATGCAATACACTTGTAGTCTTGCAACATGGTA 
ACGATGAGTTAGCAACATGCCTTACAAGGAGAGAAAAAGCACCGTGCATGCCGATTGGTG 
GAAGTAAGGTGGTACGATCGTGCCTTATTAGGAAGGCAACAGACAGGTCTGACATGGATT 
GGACGAACCACTGAATTCCGCATTGCAGAGATAATTGTATTTAAGTGCCTAGCTCGATAC 
AATAAACGCCATTTGACCATTCACCACATTGGTGTGCACCTCCAAGCTTGCATGCCTGCA 
GGTACCGGTCCGGAATTCCCGGGTCGACGAGCTCACTAGTCGTAGGGTCGCCGACATGAC 
ACAAGG6GTTGTGACCGG6GTGGACACGTACGC6GGTGCTTACGACCGTCAGTCGCGCGA 
GCGCGAGAATTCGAGCGCAGCAAGCCCAGCGACACAGCGTAGCGCCAACGAAGACAAGGC 
GGCCGACCTTCAGCGCGAAGTCGAGCGCGACGGGGGCCGGTTCAGGTTCGTCGGGCATTT 
CAGCGAAGCGCCGGGCACGTCGGCGTTCGGGACGGCGGAGCGCCCGGAGTTCGAACGCAT 
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CCTGAACGAATGCCGCGCCGGGCGGCTCTy^CATGATCATTGTCTATGACGTGTCGCGCTT 
CTCGCGCCTGAAGGTCATGGACGCGATTCCGATTGTCTCGGAATTGCTCGCCCTGGGCGT 
GACGATTGTTTCCACTCAGGAAGGCGTCTTCCGGCAGGGAAACGTCATGGACCTGATTCA 
CCTGATTATGCGGCTCGACGCGTCGCACAAAGAATCTTCGCTGAAGTCGGCGAAGATTCT 
CGACACGAAGAACCTTCAGCGCGAATTGGGCGGGTACGTCGGCGGGAAGGCGCCTTACGG 
CTTCGAGCTTGTTTCGGAGACGAAGGAGATCACGCGCAACGGCCGAATGGTCAATGTCGT 
CATCAACAAGCTTGCGCACTCGACCACTCCCCTTACCGGACCCTTCGAGTTCGAGCCCGA 
CGTAATCCGGTGGTGGTGGCGTGAGATCAAGACGCACAAACACCTTCCCTTCAAGCCGGG 
CAGTCAAGCCGCCATTCACCCGGGCAGCATCACGGGGCTTTGTAAGCGCATGGACGCTGA 
CGCCGTGCCGACCCGGGGCGAGACGATT6GGAAGAAGACCGCTTCAAGCGCCTGGGACCC 
GGCAACCGTTATGCGAATCCTTCGGGACCCGCGTATTGCGGGCTTCGCCGCTGAGGT6AT 
CTACAAGAAGAAGCCGGACGGCACGCCGACCACGAAGATTGAGGGTTACCGCATTCAGCG 
CGACCCGATCACGCTCCGGCCGGTCGAGCTTGATTGCGGACCGATCATCGAGCCCGCTGA 
GTGGTATGAGCTTCAGGCGTGGTTGGACGGCAGGGGGCGCGGCAAGGGGCTTTCCCGGGG 
GCAAGCCATTCTGTCCGCCATGGACAAGCTGTACTGCGAGTGTGGCGCCGTCATGACTTC 
GAAGCGCGGGGAAGAATCGATCAAGGACTCTTACCGCTGCCGTCGCCGGAAGGTGGTCGA 
CCCGTCCGCACCTGGGCAGCACGAAGGCACGTGCAACGTCAGCATGGCGGCACTCGACAA 
GTTCGTTGCGGAACGCATCTTCAACAAGATCAGGCACGCCGAAGGC6ACGAAGAGACGTT 
GGCGCTTCTGTGGGAAGCCGCCCGACGCTTCGGCAAGCTCACTGAGGCGCCTGAGAAGAG 
CGGCGAACGGGCGAACCTTGTTGCGGAGCGCGCCGACGCCCTGAACGCCCTTGAAGAGCT 
GTACGAAGACCGCGCGGCAGGCGCGTACGACGGACCCGTTGGCAGGAAGCACTTCCGGAA 
GCAACAGGCAGCGCTGACGCTCCGGCAGCAAGGGGCGGAAGAGCGGCTTGCCGAACTTGA 
AGCCGCCGAAGCCCCGAAGCTTCCCCTTGACCAATGGTTCCCCGAAGACGCCGACGCTGA 
CCCGACCGGCCCTAAGTCGTGGTGGGGGCGCGCGTCAGTAGACGACAAGCGCGTGTTCGT 
CGGGCTCTTCGTAGACAAGATCGTTGTCACGAAGTCGACTACGGGCAGGGGGCAGGGAAC 
GCCCATCGAGAAGCGCGCTTCGATCACGTGGGCGAAGCCGCCGACCGACGACGACGAAGA 
CGACGCCCAGGACGGCACGGAAGACGTAGCGGCGTAGCGAGACACCCGGATCCCTCGAGG 
GGCCCTATTCTATAGTGTCACCTAAATGCTAGAGCTCGCTGATCAGC CTCGACTGTGC CT 
TCTAGTTGCCAGCCATCTGTTGTTTGCCCCTCCCCCGTGCCTTCCTTGACCCTGGAAGGT 
GCCACTCCCACTGTCCTTTCCTAATAAAATGAGGAAATTGCATCGCATTGTCTGAGTAGG 
TGTCATTCTATTCTGGGGGGTGGGGTGGGGCAGGACAGCAAGGGGGAGGATTGGGAAGAC 
AATAGCAGGCATGCTGGGGATGCGGTGGGCTCTATGGCTTCTGAGGCGGAAAGAACCAGG 
TGCCCAGTCATAGCCGAATAGCCTCTCCACCCAAGCGGCCGGAGAACCTGCGTGCAATCC 
ACTGGGG6CGCG 
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AGCGCCCAATACGCAAACCGCCTCTCCCCGCGCGTTGGCCGATTCATTAATGCAGCTGGC 

ACGACAGGTTTCCCGACTGGAAAGCGGGCAGTGAGCGCAACGCAATTAATGTGAGTTAGC 

TCACTCATTAGGCACCCCAGGCTTTACACTTTATGCTTCCGGCTCGTATGTTGTGTGGAA 

TTGTGAGCGGATAACAATTTCACACAGGAAACAGCTATGACCATGATTACGCCAAGCTAT 

TTAGGT6AC6CGTTAGAATACTCAAGCTATGCATCAAGCTTGGTACCGAGCTCGGATCCA 

CTAGTAACGGCCGCO^GTGTGCTGGAATTCGCCCTTGGCCGCyU^TAAAATATCTTTATTT 

TCATTACATCTGTGTGTTGGTTTTTTGTGTGAATCGATAGTACTAACATACGCTCTCCAT 

CAAAACAAAACGAAACAAAACAAACTAGCAAAATAGGCTGTCCCCAGTGCAAGTGCAGGT 

GCCAGAACATTTCTCTATCGATAGGTACC6AGCTCTTACGCGTGCTAGCCCTCGAGCAGG 

ATCTATACa.TTGAATa^TATTGGCfATTAGCCATATTAGTCATTGGTTATATAGCATAA 

ATOATATTGGCTATTGGCCATTGCATACGTTGTATCTATATCSITAATATGTACATTTAT 

ATTGGCTCATGTCCAATATGACCGCCATGTTGACATTGATTATTGACTAGTTATTAATAG 

TAATCAATTACGGGGTCATTAGTTCATAGCCCATATATGGAGTTCCGC6TTACATAACTT 

ACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGACGTCAATAATG 

ACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAGTAT 

TTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTCCGCCCCCT 

ATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTACGG 

GACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGTGATGCGG 

TTTTGGCAGTACATCAATGGGCGTGGATAGCGGTTTGACTCACGGGGATTTCCAAGTCTC 

CACCCCATTGACGTCaATGGGAGTTTGTTTTGGCACCSU^AATCAACGGGACTTTCCA^ 

TGTCGTAACAACTCCGCCXICATTGACGCAAATGGGCGGTAGGCGTGTACGGTGGGAGGTC 

TATATAAGCAGAGCTCGTTTAGTGAACCGTCAGATCGCCTGGAGACGCCATCCACGCTGT 

TTTGACCTCCATAGAAGACACCGGGACCGATCCAGCCTCCCCTCGAAGCTCGACTCTAGG 

GGCTCGAGATCTGCGATCTAA6TAAGCTTGCATGCCTGCAGGTCGGCCGCCACGACCGGT 

GCCGCCACCATCCCCTGACCCACGCCCCTGACCCCTCACAAGGAGACGACCTTCCATGAC 

CX3AGTACAAGCCCACGGTGCGCCTCGCCACCCGCGACGACGTCCCCCGGGCCGTACGCAC 

CCTCGCCGCCGCGTTCGCCGACTACCCCX3CCACGCGCCACACCGTCGACCCGGACCGCCA 

a^TCGAGCGGGTO^CaSAGCTGCAAGAACTCTTCCTCACGCGCGTCGGGCTCGACATCGG 

CAAGGTGTGGGTCGCGGACGACGGCGCCGCGGTGGCGGTCTGGACCACGCCGGAGAGCGT 

CGAAGCGGGGGCGGTGTTCGCCGAGATCGGCCCGCGCATGGCCGAGTTGAGCGGTTCCCG 

GCTGGCCGCGCAGCAACAGATGGAAGGCCTCCTGGCGCCGCACCGGCCCAAGGAGCCCGC 

GTGGTTCCTGGCCACCGTCGGCGTCTCGCCCGACCACCAGGGCAAGGGTCTGGGCAGCGC 

CGTCX3TGCTCCCCGGAGTGGAGGCGGCCGA6CX3CX3CCX3GGGTGCCCGCCTTCCTGGAGAC 

CTCCGCGCCCCGCAACCTCCCCTTCTACGAGCGGCTCGGCTTCACCGTCACCGCC6ACGT 

CGAGGTGCCCGAAGGACCGCGCACCTGGTGCATGACCCGCAAGCCCGGTGCCTGACGCCC 

GCCCCACGACCCGCAGCGCCCGACCGAAAGGAGCGCACGACCCCATGGCTCCGACCGAAG 

CCGACCCGGGCGGCCCCGCCGACCCCGCACCCGCCCCCGAGGCCCACCGACTCTAGAGTC 

GGGGaMCCGGCCGCTTaSAGOVGAO^TGATAAGATACATteATGAGTTTGGACAAACCA 

CAACTAGAATGCAGTGAAAAAAATGCTTTATTTGTGAAATTTGTGATGCTATTGCTTTAT 

TTGTAACCATTATAAGCTGOiATAAACAAGTTAACaACAACAATTGCATTCUVTT^ 

TTCAGGTTCAGGGGGAGGTGTGGGAGGTTTTTTAAAGCAAGTAAAACCTCTACAAATGTG 

GTAAAATCGATAAGGATCAATTCGGCTTCAGGTACCGTCGACGATGTAGGTCACGGTCTC 

GAAGCCGOGGTGOSGGTGCCAGGGCGTGCCCTTGGQCTCCCCXSGGCGCGTACTCCACCTC 

ACCCATCTGGTCCATCATGATGAACGGGTCGAGGTGGCGGTAGTTGATCCCGGCGAACGC 

GCGGCGCACCGGGAAGCCCTCGCCCTCGAAACCGCTGGGCGCGGTGGTCACGGTGAGCAC 

GGGACGTGCGACGGCGTCGGCGGGTGCGGATACGCGGGGCAGCGTCAGCGGGTTCTCGAC 

GGTCACGGCGGGCATGTCGACAGCCGAATTGATCCGTCGACCGATGCCCTTGAGAGCCTT 

CAACCCAGTCAGCTCCTTCCGGTGGGCGCGGGGCATGACTATCGTCGCCGCACTTATGAC 

TGTCTTCTTTATCATGCAACTCGTAGGACAGGTGCCGGCAGCGCTCTTCCGCTTCCTCGC 

TCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGG 

CGGTAATACGGTTATCCACAGAATCAGGGGATAACGCAGGAAAGAACATGAAGGGCGAAT 

TCTGCAGATATCCaVTCACACTGGCGGCCGCTCGAGCATGCATCTAGAGGGCCCAATTCGC 

CCTATAGTGAGTCGTATTACAATTCACTGGCCGTCGTTTTACAACGTCGTGACTGGGAAA 

ACCCTGGCGTTACCCAACTTAATCGCCTTGCAGCACATCCCCCTTTCGCCAGCTGGCGTA 

ATAGCGAAGAGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTATACGTACGGC 

AGTTTAAGGTTTACACCTATAAAAGAGAGAGCCGTTATCGTCTGTTTGTGGATGTACAGA 

GTGATATTATTGACACGCCGGGGCGACGGATGGTGATCCCCCTGGCCAGTGCACGTCTGC 
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TGTCAGATAAAGTCTCCCGTGAA.CTTTACCCGGTGGTGCATATCGGGGATGAAAGCTGGC 

GCATGATGACCACCGATATGGCCAGTGTGCCGGTCTCCGTTATCGGGGAAGAAGTGGCTG 

ATCTCAGC(:a^CCGCGAAAATGACATCAAA?yVCGCCATTAACCTGATGTTCTGGGGAATAT 

AAATGTCAGGCATGAGATTATCaAAAAGGATCTTa^CCTAGATCCTTTTCACGTAGAAAG 

CCAGTCCGCAGAAACGGTGCTGACCCCGGATGAATGTCAGCTACTGGGCTATCTGGACAA 

GGGAAAACGCAAGCGCAAAGAGAAAGCAGGTAGCTTGCAGTGGGCTTACATGGCGATAGC 

TAGACTGGGCGGTTTTATGGACAGCAAGCGAACCGGAATTGCCAGCTGGGGCGCCCTCTG 

GTAAGGTTGGGAAGCCCTGCAAAGTAAACTGGATGGCTTTCTCGCCGCCAAGGATCTGAT 

GGCGaVGGGGATaUVGCTCTGATC7UM3AGACAGGATGAGGATCGTTTCGCATGATTGAAC 

AA6ATGGATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGAGGCTATTCGGCTATGACT 

GGGCACAACAGACAATCGGCTGCTCTGATGCCGCCGTGTTCCGGCTGTCAGCGCAGGGGC 

GCCCGGTTCTTTTTGTCAAGACCGACCTGTCCGGTGCCCTGAATGAACTGCAAGACGAGG 

CAGCGCGGCTATCGTGGCTGGCCACGACGGGCGTTCCTTGCGCAGCTGTGCTCGACGTTG 

TCACTGAAGCGGGAAGGGACTGGCTGCTATTGGGCGAAGTGCCGGGGCAGGATCTCCTGT 

Ca^TCTCACCTTGCTCCTGCCGAGAAAGTATCGATCATGGCTCATGCAATGCGGC^ 

ATACGCTTGATCCGGCTACCTGCCCATTCGACCACCAAGCGAAACATCGCATCGAGCGAG 

CACGTACTCGGATGGAAGCCGGTCTT6TCGATCAGGATGATCTGGACGAAGAGCATCAGG 

GGCTCGCGCCAGCCGAACTGTTCGCCAGGCTCAAGGCGAGCATGCCCGACGGCGAGGATC 

TCGTCGTGACCCATGGCGATGCCTGCTTGCCGAATATCATGGTGGAAAATGGCCGCTTTT 

CTGGATTCATCGACTGTGGCCGGCTGGGTGTGGCGGACCGCTATCAGGACATAGCGTTGG 

CTACCCGTGATATTGCTGAAGA6CTTGGCGGC6AATGGGCTGACCX3CTTCCTCGTGCTTT 

ACGGTATCGCCGCTCCCGATTCGCAGCGCATCGCCTTCTATC6CCTTCTTGACGAGTTCT 

TCTGAATTATTAACGCTTACAATTTCCTGATGCGGTATTTTCTCCTTACGCATCTGTGCG 

GTATTTCACACCGCATACAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTATTTG 

TTTATTTTTCTAAATACATTCAAATAT6TATCCGCTCATGAGACAATAACCCTGATAAAT 

GCTTCy^TAATAGOVCGTGAGGAGGGCCa^CCaVTGGCa^GTTGACaWSTGCCXSTTCCGG 

GCTCACCGCGCGCGACGTCGCCGGAGCGGTCGAGTTCTG6ACCGACCGGCTCX3GGTTCTC 

CCGGGACTTCGTGGAGGACGACTTCGCCGGTGTGGTCCGGGACGACGT6ACCCTGTTCAT 

CAGCGCGGTCCAGGACCAGGTGGTGCCGGACAACACCCTGGCCTGGGTGTGGGTGCGCGG 

CCTGGACGAGCTGTACGCCGAGTGGTCGGAGGTCGTGTCCACX3AACTTCCGGGACGCCTC 

CGGGCCGGCa^TGACCGAGATOSGCGAGCAGCCGTGGGGGCGGGAGTTCGCCCTGCGCGA 

CCCGGCCGGO^CTGCGTGCACrTCGTGGCCGAGGAGCAGGACTGAa^CGTGCrAAAACT 

TCATTTTTAATTTAAAAGGATCTAGGTGAAGATCCTTTTTGATAATCTCATGACCAAAAT 

CCCTTAACGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATC 

TTCTTGAGATCCTTTTTTTCTGCGCGTAATCT6CTGCTTGCAAACA2^AAAAACCACCGCT 

ACCAGCGGTGGTTTGTTTGCCGGATOUWSAGCTACCAACTCTTTTTCCGAAGGTAACTGG 

CTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCA 

CTTCS^GAACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTGGC 

TGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTACCGGA 

TAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAAC 

GACCTACACCGAACTGAGATACCTACAGCGTGAGCTATGAGAAAGCGCCACGCTTCCCGA 

AGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAG 

GGAGCTTCCAGGGGGAAACGCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTG 

ACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGCCTATGGAAAAACGCCAG 

CAACGCGGCCTTTTTACGGTTCCTGGGCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCC 

TGCGTTATCCCCTGATTCTGTGGATAACCGTATTACCGCCTTTGAGTGAGCTGATACCGC 

TCGCCGCAGCCGiWVCGACCGAGCGCAGCGAGTCAGTGAGCGAGGAAGCGGAAG 



FIG. 18 
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SEQ ID NO: 11 

GACTAGTACTGACGGACACACCGAAGCCCCGGCGGCAACCCTCAGCGGATGCCCCGGGGCTT 
CACGTTTTCCCAGGTCAGAAGCGGTTTTCGGGAGTAGTGCCCCAACTGGGGTAACCTTTGAG 
TTCTCTCAGTTGGGGGCGTAGGGTCGCCGACATGACACAAGGGGTTGTGACCGGGGTGGACA 
CGTACGCGGGTGCTTACGACCGTCAGTCGCGCGAGCGCGACTAGTACA 



Fig. 19 
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SEQUENCE LISTING 

<110> AviGenics, Inc 

<120> Avian integrase-mediated Transformation 
<130> A181 1080.1 
<160> 12 

<170> Patentin version 3.2 

<210> 1 
<211> 6230 
<212> DNA 

<213> Plasmid pCMV-31int 
<400> 1 

cattcgccat tcaggctgcg caactgttgg gaagggcgat cggtgcgggc ctcttcgcta 60 

ttacgccagc caatacgcaa accgcctctc cccgcgcgtt ggccgattca ttaatgcagg 120 

atcgatccag acatgataag atacattgat gagtttggac aaaccacaac tagaatgcag 180 

tgaaaaaaat gctttatttg tgaaatttgt gatgctattg ctttatttgt aaccattata 240 

agctgcaata aacaagttaa caacaacaat tgcattcatt ttatgtttca ggttcagggg 300 

gaggtgtggg aggtttttta aagcaagtaa aacctctaca aatgtggtat ggctgattat 360 

gatcatgaac agactgtgag gactgagggg cctgaaatga gccttgggac tgtgaatcta 420 

aaatacacaa acaattagaa tcactagctc ctgtgtataa tattttcata aatcatactc 480 

agtaagcaaa actctcaagc agcaagcata tgcagctagt ttaacacatt atacacttaa 540 

aaattttata tttaccttag agctttaaat ctctgtaggt agtttgtcca attatgtcac 600 

accacagaag taaggttcct tcacaaagat cccaagctag cttataatac gactcactat 660 

agggagagag ctatgacgtc gcatgcacgc gtaagcttgg gcccctcgag ggatccgggt 720 

gtctcgctac gccgctacgt cttccgtgcc gtcctgggcg tcgtcttcgt cgtcgtcggt 780 

cggcggcttc gcccacgtga tcgaagcgcg cttctcgatg ggcgttccct gccccctgcc 840 

cgtagtcgac ttcgtgacaa cgatcttgtc tacgaagagc ccgacgaaca cgcgcttgtc 900 

gtctactgac gcgcgccccc accacgactt agggccggtc gggtcagcgt cggcgtcttc 960 

ggggaaccat tggtcaaggg gaagcttcgg ggcttcggcg gcttcaagtt cggcaagccg 1020 

ctcttccgcc ccttgctgcc ggagcgtcag cgctgcctgt tgcttccgga agtgcttcct 1080 

gccaacgggt ccgtcgtacg cgcctgccgc gcggtcttcg tacagctctt caagggcgtt 1140 

cagggcgtcg gcgcgctccg caacaaggtt cgcccgttcg ccgctcttct caggcgcctc 1200 

agtgagcttg ccgaagcgtc gggcggcttc ccacagaagc gccaacgtct cttcgtcgcc 1260 

ttcggcgtgc ctgatcttgt tgaagatgcg ttccgcaacg aacttgtcga gtgccgccat 1320 

gctgacgttg cacgtgcctt cgtgctgccc aggtgcggac gggtcgacca ccttccggcg 1380 
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acggcagcgg taagagtcct tgatcgattc ttccccgcgc ttcgaagtca tgacggcgcc 1440 

acactcgcag tacagcttgt ccatggcgga cagaatggct tgcccccggg aaagcccctt 1500 

gccgcgcccc ctgccgtcca accacgcctg aagctcatac cactcagcgg gctcgatgat 1560 

cggtccgcaa tcaagctcga ccggccggag cgtgatcggg tcgcgctgaa tgcggtaacc 1620 

ctcaatcttc gtggtcggcg tgccgtccgg cttcttcttg tagatcacct cagcggcgaa 1680 

gcccgcaata cgcgggtccc gaaggattcg cataacggtt gccgggtccc aggcgcttga 1740 

agcggtcttc ttcccaatcg tctcgccccg ggtcggcacg gcgtcagcgt ccatgcgctt 1800 

acaaagcccc gtgatgctgc ccgggtgaat ggcggcttga ctgcccggct tgaagggaag 1860 

gtgtttgtgc gtcttgatct cacgccacca ccaccggatt acgtcgggct cgaactcgaa 1920 

gggtccggta aggggagtgg tcgagtgcgc aagcttgttg atgacgacat tgaccattcg 1980 

gccgttgcgc gtgatctcct tcgtctccga aacaagctcg aagccgtaag gcgccttccc 2040 

gccgacgtac ccgcccaatt cgcgctgaag gttcttcgtg tcgagaatct tcgccgactt 2100 

cagcgaagat tctttgtgcg acgcgtcgag ccgcataatc aggtgaatca ggtccatgac 2160 

gtttccctgc cggaagacgc cttcctgagt ggaaacaatc gtcacgccca gggcgagcaa 2220 

ttccgagaca atcggaatcg cgtccatgac cttcaggcgc gagaagcgcg acacgtcata 2280 

gacaatgatc atgttgagcc gcccggcgcg gcattcgttc aggatgcgtt cgaactccgg 2340 

gcgctccgcc gtcccgaacg ccgacgtgcc cggcgcttcg ctgaaatgcc cgacgaacct 2400 

gaaccggccc ccgtcgcgct cgacttcgcg ctgaaggtcg gccgccttgt cttcgttggc 2460 

gctacgctgt gtcgctgggc ttgctgcgct cgaattctcg cgctcgcgcg actgacggtc 2520 

gtaagcaccc gcgtacgtgt ccaccccggt cacaacccct tgtgtcatgt cggcgaccct 2580 

acgactagtg agctcgtcga cccgggaatt ccggaccggt acctgcaggc gtaccttcta 2640 

tagtgtcacc taaatagctt tttgcaaaag cctaggctag agtccggagg ctggatcggt 2700 

cccggtgtct tctatggagg tcaaaacagc gtggatggcg tctccaggcg atctgacggt 2760 

tcactaaacg agctctgctt atatagacct cccaccgtac acgcctaccg cccatttgcg 2820 

tcaatggggc ggagttgtta cgacattttg gaaagtcccg ttgattttgg tgccaaaaca 2880 

aactcccatt gacgtcaatg gggtggagac ttggaaatcc ccgtgagtca aaccgctatc 2940 

cacgcccatt gatgtactgc caaaaccgca tcaccatggt aatagcgatg actaatacgt 3000 

agatgtactg ccaagtagga aagtcccata aggtcatgta ctgggcataa tgccaggcgg 3060 

gccatttacc gtcattgacg tcaatagggg gcgtacttgg catatgatac acttgatgta 3120 

ctgccaagtg ggcagtttac cgtaaatact ccacccattg acgtcaatgg aaagtcccta 3180 

ttggcgttac tatgggaaca tacgtcatta ttgacgtcaa tgggcggggg tcgttgggcg 3240 
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gtcagccagg cgggccattt accgtaagtt atgtaacgac ctgcacgatg ctgtttcctg 3300 

tgtgaaattg ttatccgctc acaattccac acattatacg agccggaagc tataaagtgt 3360 

aaagcctggg gtgcctaatg agtgaaaggg cctcgtatac gcctattttt ataggttaat 3420 

gtcatgataa taatggtttc ttagacgtca ggtggcactt ttcggggaaa tgtgcgcgga 3480 

acccctattt gtttattttt ctaaatacat tcaaatatgt atccgctcat gagacaataa 3540 

ccctgataaa tgcttcaata atattgaaaa acgcgcgaat tgcaagctct gcattaatga 3600 

atcggccaac gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc 3660 

actgactcgc tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg 3720 

gtaatacggt tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc 3780 

cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc 3840 

ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga 3900 

ctataaagat accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc 3960 

ctgccgctta ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcaa 4020 

tgctcacgct gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg 4080 

cacgaacccc ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc 4140 

aacccggtaa gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga 4200 

gcgaggtatg taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact 4260 

agaaggacag tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt 4320 

ggtagctctt gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag 4380 

cagcagatta cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg 4440 

tctgacgctc agtggaacga aaactcacgt taagggattt tggtcatgcc ataacttcgt 4500 

atagcataca ttatacgaag ttatggcatg agattatcaa aaaggatctt cacctagatc 4560 

cttttaaatt aaaaatgaag ttttaaatca atctaaagta tatatgagta aacttggtct 4620 

gacagttacc aatgcttaat cagtgaggca cctatctcag cgatctgtct atttcgttca 4680 

tccatagttg cctgactccc cgtcgtgtag ataactacga tacgggaggg cttaccatct 4740 

ggccccagtg ctgcaatgat accgcgagac ccacgctcac cggctccaga tttatcagca 4800 

ataaaccagc cagccggaag ggccgagcgc agaagtggtc ctgcaacttt atccgcctcc 4860 

atccagtcta ttaattgttg ccgggaagct agagtaagta gttcgccagt taatagtttg 4920 

cgcaacgttg ttgccattgc tacaggcatc gtggtgtcac gctcgtcgtt tggtatggct 4980 

tcattcagct ccggttccca acgatcaagg cgagttacat gatcccccat gttgtgcaaa 5040 

aaagcggtta gctccttcgg tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta 5100 

tcactcatgg ttatggcagc actgcataat tctcttactg tcatgccatc cgtaagatgc 5160 
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ttttctgtga 


ctggtgagta 


ctcaaccaag 


tcattctgag aatagtgtat 


gcggcgaccg 


5220 


agttgctctt 


gcccggcgtc 


aatacgggat 


aataccgcgc cacatagcag 


aactttaaaa 


5280 


gtgctcatca 


ttggaaaacg 


ttcttcgggg 


cgaaaactct caaggatctt 


accgctgttg 


5340 


agatccagtt 


cgatgtaacc 


cactcgtgca 


cccaactgat cttcagcatc 


ttttactttc 


5400 


accagcgttt 


ctgggtgagc 


aaaaacagga 


aggcaaaatg ccgcaaaaaa 


gggaataagg 


5460 


gcgacacgga 


aatgttgaat 


actcatactc 


ttcctttttc aatattattg 


aagcatttat 


5520 


cagggttatt 


gtctcatgcc 


aggggtgggc 


acacatattt gataccagcg 


atccctacac 


, 5580 


agcacataat 


tcaatgcgac 


ttccctctat 


cgcacatctt agacctttat 


tctccctcca 


5640 


gcacacatcg 


aagctgccga 


gcaagccgtt 


ctcaccagtc caagacctgg 


catgagcgga 


5700 


tacatatttg 


aatgtattta 


gaaaaataaa 


caaatagggg ttccgcgcac 


atttccccga 


5760 


aaagtgccac 


ctgaaattgt 


aaacgttaat 


attttgttaa aattcgcgtt 


aaatttttgt 


5820 


taaatcagct 


cattttttaa 


ccaataggcc gaaatcggca aaatccctta taaatcaaaa 


5880 


gaatagaccg 


agatagggtt 


gagtgttgtt ccagtttgga acaagagtcc actattaaag 


5940 


aacgtggact 


ccaacgtcaa 


agggcgaaaa 


accgtctatc agggcgatgg 


cccactacgt 


6000 


gaaccatcac 


cctaatcaag 


ttttttgggg 


tcgaggtgcc gtaaagcact 


aaatcggaac 


6060 


cctaaaggga 


gcccccgatt 


tagagcttga 


cggggaaagc cggcgaacgt 


ggcgagaaag 


6120 


gaagggaaga 


aagcgaaagg 


agcgggcgct 


agggcgctgg caagtgtagc 


ggtcacgctg 


6180 


cgcgtaacca 


ccacacccgc 


cgcgcttaat 


gcgccgctac agggcgcgtc 




6230 


<210> 2 
<211> 5982 
<212> DNA 

<213> Plasmid pCMV-luc-attB 








<400> 2 
ctctatcgat 


aggtaccgag 


ctcttacgcg 


tgctagccct cgagcaggat 


ctatacattg 


60 


aatcaatatt 


ggcaattagc 


cat at tag tc 


attggttata tagcataaat 


caatattggc 


120 


tattggccat 


tgcatacgtt 


gtatctatat 


cataatatgt acatttatat 


tggctcatgt 


180 


ccaatatgac 


cgccatgttg acattgatta ttgactagtt attaatagta atcaattacg 


240 


gggtcattag 


ttcatagccc 


atatatggag 


ttccgcgtta cataacttac 


ggtaaatggc 


300 


ccgcctggct 


gaccgcccaa 


cgacccccgc 


ccattgacgt caataatgac 


gtatgttccc 


360 


atagtaacgc caatagggac tttccattga cgtcaatggg tggagtattt acggtaaact 


420 


gcccacttgg 


cagtacatca 


agtgtatcat 


atgccaagtc cgccccctat 


tgacgtcaat 


480 


gacggtaaat 


ggcccgcctg 


gcattatgcc 


cagtacatga ccttacggga 


ctttcctact 


540 


tggcagtaca 


tctacgtatt 


agtcatcgct 


attaccatgg tgatgcggtt 
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atcaatgggc 


gtggatagcg 


gtttgactca 


cggggatttc 


caagtctcca 


ccccattgac 


660 


gtcaatggga 


gtttgttttg 


gcaccaaaat 


caacgggact 


ttccaaaatg 


tcgtaacaac 


720 


tccgccccat 


tgacgcaaat 


gggcggtagg 


cgtgtacggt 


gggaggtcta 


tataagcaga 


780 


gctcgtttag 


tgaaccgtca 


gatcgcctgg 


agacgccatc 


cacgctgttt 


tgacctccat 


840 


agaagacacc 


gggaccgatc 


cagcctcccc 


tcgaagctcg 


actctagggg 


ctcgagatct 


900 


gcgatctaag 


taagcttggc 


attccggtac 


tgttggtaaa 


gccaccatgg 


aagacgccaa 


960 


aaacataaag 


aaaggcccgg 


cgccattcta 


tccgctggaa 


gatggaaccg 


ctggagagca 


1020 


actgcataag 


gctatgaaga 


gatacgccct 


ggttcctgga 


acaattgctt 


ttacagatgc 


1080 


acatatcgag 


gtggacatca 


cttacgctga 


gtacttcgaa 


atgtccgttc 


ggttggcaga 


1140 


agctatgaaa 


cgatatgggc 


tgaatacaaa 


tcacagaatc 


gtcgtatgca 


gtgaaaactc 


1200 


tcttcaattc 


tttatgccgg 


tgttgggcgc 


gttatttatc 


ggagttgcag 


ttgcgcccgc 


1260 


gaacgacatt 


tataatgaac 


gtgaattgct 


caacagtatg 


ggcatttcgc 


agcctaccgt 


1320 


ggtgttcgtt 


tccaaaaagg 


ggttgcaaaa 


aattttgaac 


gtgcaaaaaa 


agctcccaat 


1380 


catccaaaaa 


attattatca 


tggattctaa 


aacggattac 


cagggatttc 


agtcgatgta 


1440 


cacgttcgtc 


acatctcatc 


tacctcccgg 


ttttaatgaa 


tacgattttg 


tgccagagtc 


1500 


cttcgatagg 


gacaagacaa 


ttgcactgat 


catgaactcc 


tctggatcta 


ctggtctgcc 


1560 


taaaggtgtc 


gctctgcctc 


atagaactgc 


ctgcgtgaga 


ttctcgcatg 


ccagagatcc 


1620 


tatttttggc 


aatcaaatca 


ttccggatac 


tgcgatttta 


agtgttgttc 


cattccatca 


1680 


cggttttgga 


atgtttacta 


cactcggata 


tttgatatgt 


ggatttcgag 


tcgtcttaat 


1740 


gtatagattt 


gaagaagagc 


tgtttctgag 


gagccttcag 


gattacaaga 


ttcaaagtgc 


1800 


gctgctggtg 


ccaaccctat 


tctccttctt 


cgccaaaagc 


actctgattg 


acaaatacga 


1860 


tttatctaat 


ttacacgaaa 


ttgcttctgg 


tggcgctccc 


ctctctaagg 


aagtcgggga 


1920 


agcggttgcc 


aagaggttcc 


atctgccagg 


tatcaggcaa 


ggatatgggc 


tcactgagac 


1980 


tacatcagct 


attctgatta 


cacccgaggg 


ggatgataaa 


ccgggcgcgg 


tcggtaaagt 


2040 


tgttccattt 


tttgaagcga 


aggttgtgga 


tctggatacc 


gggaaaacgc 


tgggcgttaa 


2100 


tcaaagaggc 


gaactgtgtg 


tgagaggtcc 


tatgattatg 


tccggttatg 


taaacaatcc 


2160 


ggaagcgacc 


aacgccttga 


ttgacaagga 


tggatggcta 


cattctggag 


acatagctta 


2220 


ctgggacgaa 


gacgaacact 


tcttcatcgt 


tgaccgcctg 


aagtctctga 


ttaagtacaa 


2280 


aggctatcag 


gtggctcccg 


ctgaattgga 


atccatcttg 


ctccaacacc 


ccaacatctt 


2340 


cgacgcaggt 


gtcgcaggtc 


ttcccgacga 


tgacgccggt 


gaacttcccg 


ccgccgttgt 


2400 


tgttttggag 


cacggaaaga 


cgatgacgga 


aaaagagatc 
Page 


gtggattacg 
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tcgccagtca 


2460 
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agtaacaacc gcgaaaaagt tgcgcggagg agttgtgttt gtggacgaag taccgaaagg 2520 

tcttaccgga aaactcgacg caagaaaaat cagagagatc ctcataaagg ccaagaaggg 2580 

cggaaagatc gccgtgtaat tctagagtcg gggcggccgg ccgcttcgag cagacatgat 2640 

aagatacatt gatgagtttg gacaaaccac aactagaatg cagtgaaaaa aatgctttat 2700 

ttgtgaaatt tgtgatgcta ttgctttatt tgtaaccatt ataagctgca ataaacaagt 2760 

taacaacaac aattgcattc attttatgtt tcaggttcag ggggaggtgt gggaggtttt 2820 

ttaaagcaag taaaacctct acaaatgtgg taaaatcgat aaggatcaat tcggcttcag 2880 

gtaccgtcga cgatgtaggt cacggtctcg aagccgcggt gcgggtgcca gggcgtgccc 2940 

ttgggctccc cgggcgcgta ctccacctca cccatctggt ccatcatgat gaacgggtcg 3000 

aggtggcggt agttgatccc ggcgaacgcg cggcgcaccg ggaagccctc gccctcgaaa 3060 

ccgctgggcg cggtggtcac ggtgagcacg ggacgtgcga cggcgtcggc gggtgcggat 3120 

acgcggggca gcgtcagcgg gttctcgacg gtcacggcgg gcatgtcgac agccgaattg 3180 

atccgtcgac cgatgccctt gagagccttc aacccagtca gctccttccg gtgggcgcgg 3240 

ggcatgacta tcgtcgccgc acttatgact gtcttcttta tcatgcaact cgtaggacag 3300 

gtgccggcag cgctcttccg cttcctcgct cactgactcg ctgcgctcgg tcgttcggct 3360 

gcggcgagcg gtatcagctc actcaaaggc ggtaatacgg ttatccacag aatcagggga 3420 

taacgcagga aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc 3480 

cgcgttgctg gcgtttttcc ataggctccg cccccctgac gagcatcaca aaaatcgacg 3540 

ctcaagtcag aggtggcgaa acccgacagg actataaaga taccaggcgt ttccccctgg 3600 

aagctccctc gtgcgctctc ctgttccgac cctgccgctt accggatacc tgtccgcctt 3660 

tctcccttcg ggaagcgtgg cgctttctca atgctcacgc tgtaggtatc tcagttcggt 3720 

gtaggtcgtt cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg 3780 

cgccttatcc ggtaactatc gtcttgagtc caacccggta agacacgact tatcgccact 3840 

ggcagcagcc actggtaaca ggattagcag agcgaggtat gtaggcggtg ctacagagtt 3900 

cttgaagtgg tggcctaact acggctacac tagaaggaca gtatttggta tctgcgctct 3960 

gctgaagcca gttaccttcg gaaaaagagt tggtagctct tgatccggca aacaaaccac 4020 

cgctggtagc ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc 4080 

tcaagaagat cctttgatct tttctacggg gtctgacgct cagtggaacg aaaactcacg 4140 

ttaagggatt ttggtcatga gattatcaaa aaggatcttc acctagatcc ttttaaatta 4200 

aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa acttggtctg acagttacca 4260 

atgcttaatc agtgaggcac ctatctcagc gatctgtcta tttcgttcat ccatagttgc 4320 

ctgactcccc gtcgtgtaga taactacgat acgggagggc ttaccatctg gccccagtgc 4380 

Page 6 



wo 2004/092351 



49/87 



PCT/US2004/009253 



025CIP SEQ List.txt 

tgcaatgata ccgcgagacc cacgctcacc ggctccagat ttatcagcaa taaaccagcc 4440 

agccggaagg gccgagcgca gaagtggtcc tgcaacttta tccgcctcca tccagtctat 4500 

taattgttgc cgggaagcta gagtaagtag ttcgccagtt aatagtttgc gcaacgttgt 4560 

tgccattgct acaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt cattcagctc 4620 

cggttcccaa cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa aagcggttag 4680 

ctccttcggt cctccgatcg ttgtcagaag taagttggcc gcagtgttat cactcatggt 4740 

tatggcagca ctgcataatt ctcttactgt catgccatcc gtaagatgct tttctgtgac 4800 

tggtgagtac tcaaccaagt cattctgaga atagtgtatg cggcgaccga gttgctcttg 4860 

cccggcgtca atacgggata ataccgcgcc acatagcaga actttaaaag tgctcatcat 4920 

tggaaaacgt tcttcggggc gaaaactctc aaggatctta ccgctgttga gatccagttc 4980 

gatgtaaccc actcgtgcac ccaactgatc ttcagcatct tttactttca ccagcgtttc 5040 

tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg cgacacggaa 5100 

atgttgaata ctcatactct tcctttttca atattattga agcatttatc agggttattg 5160 

tctcatgagc ggatacatat ttgaatgtat ttagaaaaat aaacaaatag gggttccgcg 5220 

cacatttccc cgaaaagtgc cacctgacgc gccctgtagc ggcgcattaa gcgcggcggg 5280 

tgtggtggtt acgcgcagcg tgaccgctac acttgccagc gccctagcgc ccgctccttt 5340 

cgctttcttc ccttcctttc tcgccacgtt cgccggcttt ccccgtcaag ctctaaatcg 5400 

ggggctccct ttagggttcc gatttagtgc tttacggcac ctcgacccca aaaaacttga 5460 

ttagggtgat ggttcacgta gtgggccatc gccctgatag acggtttttc gccctttgac 5520 

gttggagtcc acgttcttta atagtggact cttgttccaa actggaacaa cactcaaccc 5580 

tatctcggtc tattcttttg atttataagg gattttgccg atttcggcct attggttaaa 5640 

aaatgagctg atttaacaaa aatttaacgc gaattttaac aaaatattaa cgtttacaat 5700 

ttcccattcg ccattcaggc tgcgcaactg ttgggaaggg cgatcggtgc gggcctcttc 5760 

gctattacgc cagcccaagc taccatgata agtaagtaat attaaggtac gggaggtact 5820 

tggagcggcc gcaataaaat atctttattt tcattacatc tgtgtgttgg ttttttgtgt 5880 

gaatcgatag tactaacata cgctctccat caaaacaaaa cgaaacaaaa caaactagca 5940 

aaataggctg tccccagtgc aagtgcaggt gccagaacat tt 5982 

<210> 3 
<211> 5924 
<212> DNA 

<213> Plasttiid pCMV-luc-attP 
<400> 3 

ctctatcgat aggtaccgag ctcttacgcg tgctagccct cgagcaggat ctatacattg 60 
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aatcaatatt ggcaattagc catattagtc attggttata tagcataaat caatattggc 120 

tattggccat tgcatacgtt gtatctatat cataatatgt acatttatat tggctcatgt 180 

ccaatatgac cgccatgttg acattgatta ttgactagtt attaatagta atcaattacg 240 

gggtcattag ttcatagccc atatatggag ttccgcgtta cataacttac ggtaaatggc 300 

ccgcctggct gaccgcccaa cgacccccgc ccattgacgt caataatgac gtatgttccc 360 

atagtaacgc caatagggac tttccattga cgtcaatggg tggagtattt acggtaaact 420 

gcccacttgg cagtacatca agtgtatcat atgccaagtc cgccccctat tgacgtcaat 480 

gacggtaaat ggcccgcctg gcattatgcc cagtacatga ccttacggga ctttcctact 540 

tggcagtaca tctacgtatt agtcatcgct attaccatgg tgatgcggtt ttggcagtac 600 

atcaatgggc gtggatagcg gtttgactca cggggatttc caagtctcca ccccattgac 660 

gtcaatggga gtttgttttg gcaccaaaat caacgggact ttccaaaatg tcgtaacaac 720 

tccgccccat tgacgcaaat gggcggtagg cgtgtacggt gggaggtcta tataagcaga 780 

gctcgtttag tgaaccgtca gatcgcctgg agacgccatc cacgctgttt tgacctccat 840 

agaagacacc gggaccgatc cagcctcccc tcgaagctcg actctagggg ctcgagatct 900 

gcgatctaag taagcttggc attccggtac tgttggtaaa gccaccatgg aagacgccaa 960 

aaacataaag aaaggcccgg cgccattcta tccgctggaa gatggaaccg ctggagagca 1020 

actgcataag gctatgaaga gatacgccct ggttcctgga acaattgctt ttacagatgc 1080 

acatatcgag gtggacatca cttacgctga gtacttcgaa atgtccgttc ggttggcaga 1140 

agctatgaaa cgatatgggc tgaatacaaa tcacagaatc gtcgtatgca gtgaaaactc 1200 

tcttcaattc tttatgccgg tgttgggcgc gttatttatc ggagttgcag ttgcgcccgc 1260 

gaacgacatt tataatgaac gtgaattgct caacagtatg ggcatttcgc agcctaccgt 1320 

ggtgttcgtt tccaaaaagg ggttgcaaaa aattttgaac gtgcaaaaaa agctcccaat 1380 

catccaaaaa attattatca tggattctaa aacggattac cagggatttc agtcgatgta 1440 

cacgttcgtc acatctcatc tacctcccgg ttttaatgaa tacgattttg tgccagagtc 1500 

cttcgatagg gacaagacaa ttgcactgat catgaactcc tctggatcta ctggtctgcc 1560 

taaaggtgtc gctctgcctc atagaactgc ctgcgtgaga ttctcgcatg ccagagatcc 1620 

tatttttggc aatcaaatca ttccggatac tgcgatttta agtgttgttc cattccatca 1680 

cggttttgga atgtttacta cactcggata tttgatatgt ggatttcgag tcgtcttaat 1740 

gtatagattt gaagaagagc tgtttctgag gagccttcag gattacaaga ttcaaagtgc 1800 

gctgctggtg ccaaccctat tctccttctt cgccaaaagc actctgattg acaaatacga 1860 

tttatctaat ttacacgaaa ttgcttctgg tggcgctccc ctctctaagg aagtcgggga 1920 
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agcggttgcc 


aagaggttcc 


atctgccagg 


025CIP SEQ 
tatcaggcaa 


List .txt 
ggatatgggc 


tcactgagac 


1980 


tacatcagct 


attctgatta 


cacccgaggg 


ggatgataaa 


ccgggcgcgg 


tcggtaaagt 


2040 


tgttccattt 


tttgaagcga 


aao't'ta'taQa tctaaatacc 


gggaaaacgc 


tgggcgttaa 


2100 


tcaaagaggc 


gaactgtgtg 


tgagaggtcc tatgattatg 


tccggttatg 


taaacaatcc 


2160 


ggaagcgacc 


aacgccttga 


ttgacaagga tggatggcta 


cattctggag 


acatagctta 


2220 


ctgggacgaa 


gacgaacact 


tcttcatcgt tgaccgcctg 


aagtctctga 


ttaagtacaa 


2280 


aggctatcag 


gtggctcccg 


ctgaattgga atccatcttg 


ctccaacacc 


ccaacatctt 


2340 


cgacgcaggt 


gtcgcaggtc 


ttcccgacga tgacgccggt 


gaacttcccg 


ccgccgttgt 


2400 


tgttttggag 


cacggaaaga 


cgatgacgga aaaagagatc 


gtggattacg 


tcgccagtca 


2460 


agtaacaacc 


gcgaaaaagt 




dy I. Lg Lg L L L 


gtggacgaag 


taccgaaagg 


2520 


tcttaccgga 


aaactcgacg 






ctcataaagg 


ccaagaaggg 


2580 


cggaaagatc 


gccgtgtaat 


tctagagtcg 


gggcggccgg 


ccgcttcgag 


cagacatgat 


2640 


aagatacatt 


gatgagtttg 


gacaaaccac 


aactagaatg 


cagtgaaaaa 


aatgctttat 


2700 


ttgtgaaatt 


tgtgatgcta 


ttgctttatt 


tgtaaccatt 


ataagctgca 


ataaacaagt 


2760 


taacaacaac 


aattgcattc 


attttatgtt 


tcaggttcag 


ggggaggtgt 


gggaggtttt 


2820 


ttaaagcaag 


taaaacctct 


acaaatgtgg 


taaaatcgat 


aaggatcaat 


tcggcttcga 


2880 


ctagtactga 


cggacacacc 


gaagccccgg 


cggcaaccct 


cagcggatgc 


cccggggctt 


2940 


cacgttttcc 


caggtcagaa 


gcggttttcg 


ggagtagtgc 


cccaactggg 


gtaacctttg 


3000 


agttctctca 


gttgggggcg 


tagggtcgcc 


gacatgacac 


aaggggttgt 


gaccggggtg 


3060 


gacacgtacg 


cgggtgctta 


cgaccgtcag 


tcgcgcgagc 


gcgactagta 


caagccgaat 


3120 


tgatccgtcg 


accgatgccc 


ttgagagcct 


tcaacccagt 


cagctccttc 


cggtgggcgc 


3180 


ggggcatgac 


tatcgtcgcc 


gcacttatga 


ctgtcttctt 


tatcatgcaa 


ctcgtaggac 


3240 


aggtgccggc 


agcgctcttc 


cgcttcctcg 


ctcactgact 


cgctgcgctc 


ggtcgttcgg 


3300 


ctgcggcgag 


cggtatcagc 


tcactcaaag 


gcggtaatac 


ggttatccac 


agaatcaggg 


3360 


gataacgcag 


gaaagaacat 


gtgagcaaaa ggccagcaaa 


aggccaggaa 


ccgtaaaaag 


3420 


gccgcgttgc 


tggcgttttt 


ccataggctc 


cgcccccctg 


acgagcatca 


caaaaatcga 


3480 


cgctcaagtc 


agaggtggcg 


aaacccgaca 


ggactataaa 


gataccaggc 


gtttccccct 


3540 


ggaagctccc 


tcgtgcgctc 


tcctgttccg 


accctgccgc 


ttaccggata 


cctgtccgcc 


3600 


tttctccctt 


cgggaagcgt 


ggcgctttct 


caatgctcac 


gctgtaggta 


tctcagttcg 


3660 


gtgtaggtcg 


ttcgctccaa 


gctgggctgt 


gtgcacgaac 


cccccgttca 


gcccgaccgc 


3720 


tgcgccttat 


ccggtaacta 


tcgtcttgag 


tccaacccgg 


taagacacga 


cttatcgcca 


3780 


ctggcagcag 


ccactggtaa 


caggattagc 


agagcgaggt 
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tgctacagag 


3840 



wo 2004/092351 



52/87 



PCT/US2004/009253 









025CIP SEQ 


List.txt 






ttcttgaagt 


ggtggcctaa 


ctacggctac 


actagaagga 


cagtatttgg 


tatctgcgct 


3900 


ctgctgaagc 


cagttacctt 


cggaaaaaga 


gttggtagct 


cttgatccgg 


caaacaaacc 


3960 


accgctggta 


gcggtggttt 


ttttgtttgc 


aagcagcaga 


ttacgcgcag 


aaaaaaagga 


4020 


tctcaagaag 


atcctttgat 


cttttctacg 


gggtctgacg 


ctcagtggaa 


cgaaaactca 


4080 


cgttaaggga 


ttttggtcat gagattatca aaaaggatct tcacctagat 


ccttttaaat 


4140 


taaaaatgaa 


gttttaaatc aatctaaagt 


atatatgagt 


aaacttggtc 


tgacagttac 


4200 


caatgcttaa 


tcagtgaggc acctatctca gcgatctgtc tatttcgttc 


atccatagtt 


4260 


gcctgactcc 


ccgtcgtgta gataactacg 


atacgggagg 


gcttaccatc 


tgqccccaqt 


4320 


gctgcaatga 


taccgcgaga 


cccacgctca 


ccggctccag 


atttatcagc 


aataaaccag 


4380 


ccagccggaa 


gggccgagcg 


cagaagtggt 


cctgcaactt 


tatccgcctc 


catccagtct 


4440 


attaattgtt 


gccgggaagc tagagtaagt 


agttcgccag 


ttaatagttt 


gcgcaacgtt 


4500 


gttgccattg 


ctacaggcat 


cgtggtgtca 


cgctcgtcgt 


ttggtatggc 


ttcattcagc 


4560 


tccggttccc 


aacgatcaag gcgagttaca tgatccccca tgttgtgcaa 


aaaagcggtt 


4620 


agctccttcg 


gtcctccgat 


cgttgtcaga agtaagttgg 


ccgcagtgtt 


atcactcatg 


4680 


gttatggcag 


cactgcataa 


ttctcttact 


gtcatgccat 


ccgtaagatg 


cttttctgtg 


4740 


actggtgagt 


actcaaccaa 


gtcattctga gaatagtgta tgcggcgacc 


gagttgctct 


4800 


tgcccggcgt 


caatacggga taataccgcg ccacatagca gaactttaaa 


agtgctcatc 


4860 


attggaaaac 


gttcttcggg 


gcgaaaactc 


tcaaggatct 


taccgctgtt 


gagatccagt 


4920 


tcgatgtaac 


ccactcgtgc 


acccaactga tcttcagcat 


cttttacttt 


caccagcgtt 


4980 


tctgggtgag 


caaaaacagg 


aaggcaaaat 


gccgcaaaaa 


agggaataag 


ggcgacacqq 


5040 


aaatgttgaa 


tactcatact 


cttccttttt 


caatattatt 


gaagcattta 


tcagggttat 


5100 


tgtctcatga 


gcggatacat 


atttgaatgt 


atttagaaaa 


ataaacaaat 


aqqqqttccq 


5160 


cgcacatttc 


cccgaaaagt 


gccacctgac 


gcgccctgta 


gcggcgcatt 


aagcqcqqcq 


5220 


ggtgtggtgg 


ttacgcgcag 


cgtgaccgct 


acacttgcca 


gcgccctagc 


gcccgctcct 


5280 


ttcgctttct 


tcccttcctt 


tctcgccacg 


ttcgccggct 


ttccccgtca 


agctctaaat 


5340 


cgggggctcc 


ctttagggtt 


ccgatttagt 


gctttacggc 


acctcgaccc 


caaaaaactt 


5400 


gattagggtg 


atggttcacg 


tagtgggcca 


tcgccctgat 


agacggtttt 


1" ^~ n r" f" f"f" 1" 1" /I 




acgttggagt 


ccacgttctt 


taatagtgga 


ctcttgttcc 


aaactggaac 


aacactcaac 


5520 


cctatctcgg 


tctattcttt 


tgatttataa 


gggattttgc 


cgatttcggc 


ctattggtta 


5580 


aaaaatgagc 


tgatttaaca 


aaaatttaac 


gcgaatttta 


acaaaatatt 


aacgtttaca 


5640 


atttcccatt 


cgccattcag 


gctgcgcaac tgttgggaag 


ggcgatcggt 


gcgggcctct 


5700 
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tcgctattac 


gccagcccaa gctaccatga 


taagtaagta 


atattaaggt 


acgggaggta 


5760 


cttggagcgg 


ccgcaataaa 


atatctttat 


tttcattaca 


tctgtgtgtt 


ggttttttgt 


5820 


gtgaatcgat 


agtactaaca 


tacgctctcc 


atcaaaacaa 


aacgaaacaa 


aacaaactag 


5880 


caaaataggc 


tgtccccagt 


gcaagtgcag 


g-tgccagaac 


attt 






<210> 4 
<211> 5101 
<212> DNA 

<213> Plasmid pCMV-pur-attB 










<400> 4 
ctagagtcgg 


ggcggccggc 


cgcttcgagc 


agacatgata 


agatacattg 


atgagtttgg 


60 


acaaaccaca 


actagaatgc 


agtgaaaaaa 


atgctttatt 


tgtgaaattt 


gtgatgctat 


120 


tgctttattt 


gtaaccatta 


taagctgcaa 


taaacaagtt 


aacaacaaca 


attgcattca 


180 


ttttatgttt 


caggttcagg 


gggaggtgtg 


ggaggttttt 


taaagcaagt 


aaaacctcta 


240 


caaatgtggt 


aaaatcgata 


aggatcaatt 


cggcttcagg 


taccgtcgac 


gatgtaggtc 


300 


acggtctcga 


agccgcggtg 


cgggtgccag 


ggcgtgccct 


tgggctcccc 


gggcgcgtac 


360 


tccacctcac 


ccatctggtc 


catcatgatg 


aacgggtcga 


ggtggcggta 


gttgatcccg 


420 


gcgaacgcgc 


ggcgcaccgg 


gaagccctcg 


ccctcgaaac 


cgctgggcgc 


ggtggtcacg 


480 


gtgagcacgg 


gacgtgcgac 


ggcgtcggcg 


ggtgcggata 


cgcggggcag 


cgtcagcggg 


540 


ttctcgacgg 


tcacggcggg 


catgtcgaca 


gccgaattga 


tccgtcgacc 


gatgcccttg 


600 


agagccttca 


acccagtcag 


ctccttccgg 


tgggcgcggg 


gcatgactat 


cgtcgccgca 


660 


cttatgactg 


tcttctttat 


catgcaactc 


gtaggacagg 


tgccggcagc 


gctcttccgc 


720 


ttcctcgctc 


actgactcgc 


tgcgctcggt 


cgttcggctg 


cggcgagcgg 


tatcagctca 


780 


ctcaaaggcg 


gtaatacggt 


tatccacaga 


atcaggggat 


aacgcaggaa 


agaacatgtg 


840 


agcaaaaggc 


cagcaaaagg 


ccaggaaccg 


taaaaaggcc 


gcgttgctgg 


cgtttttcca 


900 


taggctccgc 


ccccctgacg 


agcatcacaa 


aaatcgacgc 


tcaagtcaga 


ggtggcgaaa 


960 


cccgacagga 


ctataaagat 


accaggcgtt 


tccccctgga 


agctccctcg 


tgcgctctcc 


1020 


tgttccgacc 


ctgccgctta 


ccggatacct 


gtccgccttt 


ctcccttcgg 


gaagcgtggc 


1080 


gctttctcaa 


tgctcacgct 


gtaggtatct 


cagttcggtg 


taggtcgttc 


gctccaagct 


1140 


gggctgtgtg 


cacgaacccc 


ccgttcagcc 


cgaccgctgc 


gccttatccg 


gtaactatcg 


1200 


tcttgagtcc 


aacccggtaa 


gacacgactt 


atcgccactg 


gcagcagcca 


ctggtaacag 


1260 


gattagcaga 


gcgaggtatg 


taggcggtgc 


tacagagttc 


ttgaagtggt 


ggcctaacta 


1320 


cggctacact 


agaaggacag 


tatttggtat 


ctgcgctctg 


ctgaagccag 


ttaccttcgg 


1380 


aaaaagagtt 


ggtagctctt 


gatccggcaa 


acaaaccacc 
Page 


gctggtagcg 
11 


gtggtttttt 


1440 
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tgtttgcaag cagcagatta cgcgcagaaa aaaaggatct caagaagatc ctttgatctt 1500 

ttctacgggg tctgacgctc agtggaacga aaactcacgt taagggattt tggtcatgag 1560 

attatcaaaa aggatcttca cctagatcct tttaaattaa aaatgaagtt ttaaatcaat 1620 

ctaaagtata tatgagtaaa cttggtctga cagttaccaa tgcttaatca gtgaggcacc 1680 

tatctcagcg atctgtctat ttcgttcatc catagttgcc tgactccccg tcgtgtagat 1740 

aactacgata cgggagggct taccatctgg ccccagtgct gcaatgatac cgcgagaccc 1800 

acgctcaccg gctccagatt tatcagcaat aaaccagcca gccggaaggg ccgagcgcag 1860 

aagtggtcct gcaactttat ccgcctccat ccagtctatt aattgttgcc gggaagctag 1920 

agtaagtagt tcgccagtta atagtttgcg caacgttgtt gccattgcta caggcatcgt 1980 

ggtgtcacgc tcgtcgtttg gtatggcttc attcagctcc ggttcccaac gatcaaggcg 2040 

agttacatga tcccccatgt tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt 2100 

tgtcagaagt aagttggccg cagtgttatc actcatggtt atggcagcac tgcataattc 2160 

tcttactgtc atgccatccg taagatgctt ttctgtgact ggtgagtact caaccaagtc 2220 

attctgagaa tagtgtatgc ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa 2280 

taccgcgcca catagcagaa ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg 2340 

aaaactctca aggatcttac cgctgttgag atccagttcg atgtaaccca ctcgtgcacc 2400 

caactgatct tcagcatctt ttactttcac cagcgtttct gggtgagcaa aaacaggaag 2460 

gcaaaatgcc gcaaaaaagg gaataagggc gacacggaaa tgttgaatac tcatactctt 2520 

cctttttcaa tattattgaa gcatttatca gggttattgt ctcatgagcg gatacatatt 2580 

tgaatgtatt tagaaaaata aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc 2640 

acctgacgcg ccctgtagcg gcgcattaag cgcggcgggt gtggtggtta cgcgcagcgt 2700 

gaccgctaca cttgccagcg ccctagcgcc cgctcctttc gctttcttcc cttcctttct 2760 

cgccacgttc gccggctttc cccgtcaagc tctaaatcgg gggctccctt tagggttccg 2820 

atttagtgct ttacggcacc tcgaccccaa aaaacttgat tagggtgatg gttcacgtag 2880 

tgggccatcg ccctgataga cggtttttcg ccctttgacg ttggagtcca cgttctttaa 2940 

tagtggactc ttgttccaaa ctggaacaac actcaaccct atctcggtct attcttttga 3000 

tttataaggg attttgccga tttcggccta ttggttaaaa aatgagctga tttaacaaaa 3060 

atttaacgcg aattttaaca aaatattaac gtttacaatt tcccattcgc cattcaggct 3120 

gcgcaactgt tgggaagggc gatcggtgcg ggcctcttcg ctattacgcc agcccaagct 3180 

accatgataa gtaagtaata ttaaggtacg ggaggtactt ggagcggccg caataaaata 3240 

tctttatttt cattacatct gtgtgttggt tttttgtgtg aatcgatagt actaacatac 3300 

gctctccatc aaaacaaaac gaaacaaaac aaactagcaa aataggctgt ccccagtgca 3360 
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agtgcaggtg ccagaacatt tctctatcga taggtaccga gctcttacgc gtgctagccc 3420 

tcgagcagga tctatacatt gaatcaatat tggcaattag ccatattagt cattggttat 3480 

atagcataaa tcaatattgg ctattggcca ttgcatacgt tgtatctata tcataatatg 3540 

tacatttata ttggctcatg tccaatatga ccgccatgtt gacattgatt attgactagt 3600 

tattaatagt aatcaattac ggggtcatta gttcatagcc catatatgga gttccgcgtt 3660 

acataactta cggtaaatgg cccgcctggc tgaccgccca acgacccccg cccattgacg 3720 

tcaataatga cgtatgttcc catagtaacg ccaataggga ctttccattg acgtcaatgg 3780 

gtggagtatt tacggtaaac tgcccacttg gcagtacatc aagtgtatca tatgccaagt 3840 

ccgcccccta ttgacgtcaa tgacggtaaa tggcccgcct ggcattatgc ccagtacatg 3900 

accttacggg actttcctac ttggcagtac atctacgtat tagtcatcgc tattaccatg 3960 

gtgatgcggt tttggcagta catcaatggg cgtggatagc ggtttgactc acggggattt 4020 

ccaagtctcc accccattga cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac 4080 

tttccaaaat gtcgtaacaa ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg 4140 

tgggaggtct atataagcag agctcgttta gtgaaccgtc agatcgcctg gagacgccat 4200 

ccacgctgtt ttgacctcca tagaagacac cgggaccgat ccagcctccc ctcgaagctc 4260 

gactctaggg gctcgagatc tgcgatctaa gtaagcttgc atgcctgcag gtcggccgcc 4320 

acgaccggtg ccgccaccat cccctgaccc acgcccctga cccctcacaa ggagacgacc 4380 

ttccatgacc gagtacaagc ccacggtgcg cctcgccacc cgcgacgacg tcccccgggc 4440 

cgtacgcacc ctcgccgccg cgttcgccga ctaccccgcc acgcgccaca ccgtcgaccc 4500 

ggaccgccac atcgagcggg tcaccgagct gcaagaactc ttcctcacgc gcgtcgggct 4560 

cgacatcggc aaggtgtggg tcgcggacga cggcgccgcg gtggcggtct ggaccacgcc 4620 

ggagagcgtc gaagcggggg cggtgttcgc cgagatcggc ccgcgcatgg ccgagttgag 4680 

:ggttcccgg ctggccgcgc agcaacagat ggaaggcctc ctggcgccgc accggcccaa 4740 

jgagcccgcg tggttcctgg ccaccgtcgg cgtctcgccc gaccaccagg gcaagggtct 4800 

iggcagcgcc gtcgtgctcc ccggagtgga ggcggccgag cgcgccgggg tgcccgcctt 4860 

xtggagacc tccgcgcccc gcaacctccc cttctacgag cggctcggct tcaccgtcac 4920 

:gccgacgtc gaggtgcccg aaggaccgcg cacctggtgc atgacccgca agcccggtgc 4980 

tgacgcccg ccccacgacc cgcagcgccc gaccgaaagg agcgcacgac cccatggctc 5040 

gaccgaagc cgacccgggc ggccccgccg accccgcacc cgcccccgag gcccaccgac 5100 

5101 

210> 5 
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<211> 5043 
<212> DNA 

<213> Plasmid pCMV-pur-attP 
<400> 5 

ctagagtcgg ggcggccggc cgcttcgagc agacatgata agatacattg atgagtttgg 60 

acaaaccaca actagaatgc agtgaaaaaa atgctttatt tgtgaaattt gtgatgctat 120 

tgctttattt gtaaccatta taagctgcaa taaacaagtt aacaacaaca attgcattca 180 

ttttatgttt caggttcagg gggaggtgtg ggaggttttt taaagcaagt aaaacctcta 240 

caaatgtggt aaaatcgata aggatcaatt cggcttcgac tagtactgac ggacacaccg 300 

aagccccggc ggcaaccctc agcggatgcc ccggggcttc acgttttccc aggtcagaag 360 

cggttttcgg gagtagtgcc ccaactgggg taacctttga gttctctcag ttgggggcgt 420 

agggtcgccg acatgacaca aggggttgtg accggggtgg acacgtacgc gggtgcttac 480 

gaccgtcagt cgcgcgagcg cgactagtac aagccgaatt gatccgtcga ccgatgccct 540 

tgagagcctt caacccagtc agctccttcc ggtgggcgcg gggcatgact atcgtcgccg 600 

cacttatgac tgtcttcttt atcatgcaac tcgtaggaca ggtgccggca gcgctcttcc 660 

gcttcctcgc tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct 720 

cactcaaagg cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg 780 

tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc 840 

cataggctcc gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga 900 

aacccgacag gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct 960 

cctgttccga ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg 1020 

gcgctttctc aatgctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag 1080 

ctgggctgtg tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat 1140 

cgtcttgagt ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac 1200 

aggattagca gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac 1260 

tacggctaca ctagaaggac agtatttggt atctgcgctc tgctgaagcc agttaccttc 1320 

ggaaaaagag ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt 1380 

tttgtttgca agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc 1440 

ttttctacgg ggtctgacgc tcagtggaac gaaaactcac gttaagggat tttggtcatg 1500 

agattatcaa aaaggatctt cacctagatc cttttaaatt aaaaatgaag ttttaaatca 1560 

atctaaagta tatatgagta aacttggtct gacagttacc aatgcttaat cagtgaggca 1620 

cctatctcag cgatctgtct atttcgttca tccatagttg cctgactccc cgtcgtgtag 1680 

ataactacga tacgggaggg cttaccatct ggccccagtg ctgcaatgat accgcgagac 1740 
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ccacgctcac 


cggctccaga 


025CIP SEQ 
tttatcagca ataaaccagc 


List.txt 
cagccggaag 


ggccgagcgc 


1800 


agaagtggtc 


ctgcaacttt 


atccgcctcc atccagtcta 


ttaattgttg 


ccgggaagct 


1860 


agagtaagta 


gttcgccagt 


taatagtttg cgcaacgttg 


ttgccattgc 


tacaggcatc 


1920 


gtggtgtcac 


gctcgtcgtt 


tggtatggct tcattcagct 


ccggttccca 


acgatcaagg 


1980 


cgagttacat 


gatcccccat 


gttgtgcaaa aaagcggtta 


gctccttcgg 


tcctccgatc 


2040 


gttgtcagaa 


gtaagttggc 


cgcagtgtta tcactcatgg 


ttatggcagc 


actgcatsat 


2100 


tctcttactg 


tcatgccatc 


cgtaagatgc ttttctgtga 


ctggtgagta 


ctcaaccaag 


2160 


tcattctgag 


aatagtgtat 


gcggcgaccg agttgctctt 


qcccqqcqtc 


aatacgggat 


2220 


aataccgcgc 


cacatagcag 


aactttaaaa gtgctcatca 


ttggaaaacg 


ttcttcqqqq 


2280 


cgaaaactct 


caaggatctt 


accgctgttg agatccagtt 


cgatgtaacc 


cactcgtgca 


2340 


cccaactgat 


cttcagcatc 


ttttactttc accagcgttt 


ctgqgtqagc 


aaaaacagga 


2400 


aggcaaaatg 


ccgcaaaaaa 


gggaataagg gcgacacgga 


aatgttgaat 


actcatactc 


2460 


ttcctttttc 


aatattattg 


aagcatttat cagggttatt 


gtctcatgag 


cggatacata 


2520 


tttgaatgta 


tttagaaaaa 


taaacaaata ggggttccgc 


gcacatttcc 


ccgaaaagtg 


2580 


ccacctgacg 


cgccctgtag 


cggcgcatta agcgcggcgg 


gtgtggtggt 


tacqcqcaqc 


2640 


gtgaccgcta 


cacttgccag 


cgccctagcg cccgctcctt 


tcgctttctt 


cccttccttt 


2700 


ctcgccacgt 


tcgccggctt 


tccccgtcaa gctctaaatc 




tttaqqqttc 


2760 


cgatttagtg 


ctttacggca 


cctcgacccc aaaaaacttg 


attagggtga 


tqqttcacqt 


2820 


agtgggccat 


cgccctgata 


gacggttttt cgccctttga 


cgttggagtc 


cacgttcttt 


2880 


aatagtggac 


tcttgttcca 


aactggaaca acactcaacc 


ctatctcggt 


ctattctttt 


2940 


gatttataag 


ggattttgcc 


gatttcggcc tattggttaa 


aaaatgagct 


gatttaacaa 


3000 


aaatttaacg 


cgaattttaa 


caaaatatta acgtttacaa 


tttcccattc 


gccattcagg 


3060 


ctgcgcaact 


gttgggaagg 


gcgatcggtg cgggcctctt 


cgctattacg 


ccagcccaag 


3120 


ctaccatgat 


aagtaagtaa 


tattaaggta cgggaggtac 


ttggagcggc 

99 9 


cgcaataaaa 


3180 


tatctttatt 


ttcattacat 


ctgtgtgttg gttttttgtg 


tgaatcgata 


gtactaacat 


3240 


acgctctcca 


tcaaaacaaa 


acgaaacaaa acaaactagc 


aaaataggct 


gtccccagtg 


3300 


caagtgcagg 


tgccagaaca 


tttctctatc gataggtacc 


gagctcttac 


gcgtgctagc 


3360 


cctcgagcag 


gatctataca 


1:1:0331:0331: sttaacaa'tt 


Cl^^V«Cl LCI U CCL 


n1" r* a "f* 1" n n 1" "t" 




atatagcata 


aatcaatatt 


ggctattggc cattgcatac 


gttgtatcta 


tatcataata 


3480 


tgtacattta 


tattggctca 


tgtccaatat gaccgccatg 


ttgacattga 


ttattgacta 


3540 


gttattaata 


gtaatcaatt 


acggggtcat tagttcatag 


cccatatatg 


gagttccgcg 


3600 


ttacataact 


tacggtaaat 


ggcccgcctg gctgaccgcc 


caacgacccc 


cgcccattga 


3660 
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cgtcaataat gacgtatgtt cccatagtaa cgccaatagg gactttccat tgacgtcaat 3720 

gggtggagta tttacggtaa actgcccact tggcagtaca tcaagtgtat catatgccaa 3780 

gtccgccccc tattgacgtc aatgacggta aatggcccgc ctggcattat gcccagtaca 3840 

tgaccttacg ggactttcct acttggcagt acatctacgt attagtcatc gctattacca 3900 

tggtgatgcg gttttggcag tacatcaatg ggcgtggata gcggtttgac tcacggggat 3960 

ttccaagtct ccaccccatt gacgtcaatg ggagtttgtt ttggcaccaa aatcaacggg 4020 

actttccaaa atgtcgtaac aactccgccc cattgacgca aatgggcggt aggcgtgtac 4080 

ggtgggaggt ctatataagc agagctcgtt tagtgaaccg tcagatcgcc tggagacgcc 4140 

atccacgctg ttttgacctc catagaagac accgggaccg atccagcctc ccctcgaagc 4200 

tcgactctag gggctcgaga tctgcgatct aagtaagctt gcatgcctgc aggtcggccg 4260 

ccacgaccgg tgccgccacc atcccctgac ccacgcccct gacccctcac aaggagacga 4320 

ccttccatga ccgagtacaa gcccacggtg cgcctcgcca cccgcgacga cgtcccccgg 4380 

gccgtacgca ccctcgccgc cgcgttcgcc gactaccccg ccacgcgcca caccgtcgac 4440 

ccggaccgcc acatcgagcg ggtcaccgag ctgcaagaac tcttcctcac gcgcgtcggg 4500 

ctcgacatcg gcaaggtgtg ggtcgcggac gacggcgccg cggtggcggt ctggaccacg 4560 

ccggagagcg tcgaagcggg ggcggtgttc gccgagatcg gcccgcgcat ggccgagttg 4620 

agcggttccc ggctggccgc gcagcaacag atggaaggcc tcctggcgcc gcaccggccc 4680 

aaggagcccg cgtggttcct ggccaccgtc ggcgtctcgc ccgaccacca gggcaagggt 4740 

ctgggcagcg ccgtcgtgct ccccggagtg gaggcggccg agcgcgccgg ggtgcccgcc 4800 

ttcctggaga cctccgcgcc ccgcaacctc cccttctacg agcggctcgg cttcaccgtc 4860 

accgccgacg tcgaggtgcc cgaaggaccg cgcacctggt gcatgacccg caagcccggt 4920 

gcctgacgcc cgccccacga cccgcagcgc ccgaccgaaa ggagcgcacg accccatggc 4980 

tccgaccgaa gccgacccgg gcggccccgc cgaccccgca cccgcccccg aggcccaccg 5040 

5043 

<210> 6 
<211> 5041 
<212> DNA 

<213> Plasmid pCMV-EGFP-attB 
<400> 6 

ctagagtcgg ggcggccggc cgcttcgagc agacatgata agatacattg atgagtttgg 60 

acaaaccaca actagaatgc agtgaaaaaa atgctttatt tgtgaaattt gtgatgctat 120 

tgctttattt gtaaccatta taagctgcaa taaacaagtt aacaacaaca attgcattca 180 

ttttatgttt caggttcagg gggaggtgtg ggaggttttt taaagcaagt aaaacctcta 240 
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caaatgtggt aaaatcgata aggatcaatt cggcttcagg taccgtcgac gatgtaggtc 300 

acggtctcga agccgcggtg cgggtgccag ggcgtgccct tgggctcccc gggcgcgtac 360 

tccacctcac ccatctggtc catcatgatg aacgggtcga ggtggcggta gttgatcccg 420 

gcgaacgcgc ggcgcaccgg gaagccctcg ccctcgaaac cgctgggcgc ggtggtcacg 480 

gtgagcacgg gacgtgcgac ggcgtcggcg ggtgcggata cgcggggcag cgtcagcggg 540 

ttctcgacgg tcacggcggg catgtcgaca gccgaattga tccgtcgacc gatgcccttg 600 

agagccttca acccagtcag ctccttccgg tgggcgcggg gcatgactat cgtcgccgca 660 

cttatgactg tcttctttat catgcaactc gtaggacagg tgccggcagc gctcttccgc 720 

ttcctcgctc actgactcgc tgcgctcggt cgttcggctg cggcgagcgg tatcagctca 780 

ctcaaaggcg gtaatacggt tatccacaga atcaggggat aacgcaggaa agaacatgtg 840 

agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca 900 

taggctccgc ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa 960 

cccgacagga ctataaagat accaggcgtt tccccctgga agctccctcg tgcgctctcc 1020 

tgttccgacc ctgccgctta ccggatacct gtccgccttt ctcccttcgg gaagcgtggc 1080 

gctttctcaa tgctcacgct gtaggtatct cagttcggtg taggtcgttc gctccaagct 1140 

gggctgtgtg cacgaacccc ccgttcagcc cgaccgctgc gccttatccg gtaactatcg 1200 

tcttgagtcc aacccggtaa gacacgactt atcgccactg gcagcagcca ctggtaacag 1260 

gattagcaga gcgaggtatg taggcggtgc tacagagttc ttgaagtggt ggcctaacta 1320 

cggctacact agaaggacag tatttggtat ctgcgctctg ctgaagccag ttaccttcgg 1380 

aaaaagagtt ggtagctctt gatccggcaa acaaaccacc gctggtagcg gtggtttttt 1440 

tgtttgcaag cagcagatta cgcgcagaaa aaaaggatct caagaagatc ctttgatctt 1500 

ttctacgggg tctgacgctc agtggaacga aaactcacgt taagggattt tggtcatgag 1560 

attatcaaaa aggatcttca cctagatcct tttaaattaa aaatgaagtt ttaaatcaat 1620 

ctaaagtata tatgagtaaa cttggtctga cagttaccaa tgcttaatca gtgaggcacc 1680 

tatctcagcg atctgtctat ttcgttcatc catagttgcc tgactccccg tcgtgtagat 1740 

aactacgata cgggagggct taccatctgg ccccagtgct gcaatgatac cgcgagaccc 1800 

acgctcaccg gctccagatt tatcagcaat aaaccagcca gccggaaggg ccgagcgcag 1860 

aagtggtcct gcaactttat ccgcctccat ccagtctatt aattgttgcc gggaagctag 1920 

agtaagtagt tcgccagtta atagtttgcg caacgttgtt gccattgcta caggcatcgt 1980 

ggtgtcacgc tcgtcgtttg gtatggcttc attcagctcc ggttcccaac gatcaaggcg 2040 

agttacatga tcccccatgt tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt 2100 
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tgtcagaagt aagttggccg cagtgttatc actcatggtt atggcagcac tgcataattc 2160 

tcttactgtc atgccatccg taagatgctt ttctgtgact ggtgagtact caaccaagtc 2220 

attctgagaa tagtgtatgc ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa 2280 

taccgcgcca catagcagaa ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg 2340 

aaaactctca aggatcttac cgctgttgag atccagttcg atgtaaccca ctcgtgcacc 2400 

caactgatct tcagcatctt ttactttcac cagcgtttct gggtgagcaa aaacaggaag 2460 

gcaaaatgcc gcaaaaaagg gaataagggc gacacggaaa tgttgaatac tcatactctt 2520 

cctttttcaa tattattgaa gcatttatca gggttattgt ctcatgagcg gatacatatt 2580 

tgaatgtatt tagaaaaata aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc 2640 

acctgacgcg ccctgtagcg gcgcattaag cgcggcgggt gtggtggtta cgcgcagcgt 2700 

gaccgctaca cttgccagcg ccctagcgcc cgctcctttc gctttcttcc cttcctttct 2760 

cgccacgttc gccggctttc cccgtcaagc tctaaatcgg gggctccctt tagggttccg 2820 

atttagtgct ttacggcacc tcgaccccaa aaaacttgat tagggtgatg gttcacgtag 2880 

tgggccatcg ccctgataga cggtttttcg ccctttgacg ttggagtcca cgttctttaa 2940 

tagtggactc ttgttccaaa ctggaacaac actcaaccct atctcggtct attcttttga 3000 

tttataaggg attttgccga tttcggccta ttggttaaaa aatgagctga tttaacaaaa 3060 

atttaacgcg aattttaaca aaatattaac gtttacaatt tcccattcgc cattcaggct 3120 

gcgcaactgt tgggaagggc gatcggtgcg ggcctcttcg ctattacgcc agcccaagct 3180 

accatgataa gtaagtaata ttaaggtacg ggaggtactt ggagcggccg caataaaata 3240 

tctttatttt cattacatct gtgtgttggt tttttgtgtg aatcgatagt actaacatac 3300 

gctctccatc aaaacaaaac gaaacaaaac aaactagcaa aataggctgt ccccagtgca 3360 

agtgcaggtg ccagaacatt tctctatcga taggtaccga gctcttacgc gtgctagccc 3420 

tcgagcagga tctatacatt gaatcaatat tggcaattag ccatattagt cattggttat 3480 

atagcataaa tcaatattgg ctattggcca ttgcatacgt tgtatctata tcataatatg 3540 

tacatttata ttggctcatg tccaatatga ccgccatgtt gacattgatt attgactagt 3600 

tattaatagt aatcaattac ggggtcatta gttcatagcc catatatgga gttccgcgtt 3660 

acataactta cggtaaatgg cccgcctggc tgaccgccca acgacccccg cccattgacg 3720 

tcaataatga cgtatgttcc catagtaacg ccaataggga ctttccattg acgtcaatgg 3780 

gtggagtatt tacggtaaac tgcccacttg gcagtacatc aagtgtatca tatgccaagt 3840 

ccgcccccta ttgacgtcaa tgacggtaaa tggcccgcct ggcattatgc ccagtacatg 3900 

accttacggg actttcctac ttggcagtac atctacgtat tagtcatcgc tattaccatg 3960 

gtgatgcggt tttggcagta catcaatggg cgtggatagc ggtttgactc acggggattt 4020 
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ccaagtctcc 


accccattga cgtcaatggg agtttgtttt 


ggcaccaaaa 


tcaacgggac 


4080 


tttccaaaat 


gtcgtaacaa ctccgcccca ttgacgcaaa 


tgggcggtag 


gcqtqtacqq 


4140 


tgggaggtct 


atataagcag agctcgttta gtgaaccgtc 


agatcgcctg 


gagacgccat 


4200 


ccacgctgtt 


ttgacctcca tagaagacac cgggaccgat 


ccagcctccc 


ctcgaagctc 


4260 


gactctaggg 


gctcgagatc cccgggtacc ggtcgccacc 


<it^ggtgagca 


agqqcqaqaa 


4320 


gctgttcacc 


ggggtggtgc ccatcctggt cgagctggac 


ggcgacgtaa 


acqqccacaa 


4380 


gttcagcgtg 


tccggcgagg gcgagggcga tgccacctac 


ggcaagctga 


ccctqaaatt 


4440 


catctgcacc 


accggcaagc tgcccgtgcc ctggcccacc 


ctcgtgacca 


ccctaaccta 




cggcgtgcag 


tgcttcagcc gctaccccga ccacatgaag 


cagcacgact 


tcttcaagtc 


4560 


cgccatgccc 


gaaggctacg tccaggagcg caccatcttc 


ttcaaggacg 


acQQcaacta 


4620 


caagacccgc 


gccgaggtga agttcgaggg cgacaccctg 


gtgaaccgca 


tcqaqctaaa 


4680 


gggcatcgac 


ttcaaggagg acggcaacat cctggggcac 


aagctggagt 


acaactacaa 


4740 


cagccacaac 


gtctatatca tggccgacaa gcagaagaac 


ggcatcaagg 


tgaacttcaa 


4800 


gatccgccac 


aacatcaaaa acaacaorai" oranri-rnrr- 


9<tCCaC LaCC 


agcagaacac 


4860 


ccccatcggc 


gacggccccg tgctgctgcc cgacaaccac 


tacctgagca 


cccagtccgc 


4920 


cctgagcaaa 


gaccccaacg agaagcgcga tcacatggtc 


ctgctggagt 


tcgtgaccgc 


4980 


cgccgggatc 


actctcggca tggacgagct gtacaagtaa 


agcggccgct 


cgagcatgca 


5040 


t 








5041 



<210> 7 
<211> 18116 
<212> DNA 

<213> Plasmid pl2.01ys-LSPlPNMM-CMV-pur-attB 
<400> 7 

gggctgcagg aattcgattg ccgccttctt tgatattcac tctgttgtat ttcatctctt 60 

cttgccgatg aaaggatata acagtctgta taacagtctg tgaggaaata cttggtattt 120 

cttctgatca gtgtttttat aagtaatgtt gaatattgga taaggctgtg tgtcctttgt 180 

cttgggagac aaagcccaca gcaggtggtg gttggggtgg tggcagctca gtgacaggag 240 

aggttttttt gcctgttttt tttttttttt ttttttttaa gtaaggtgtt cttttttctt 300 

agtaaatttt ctactggact gtatgttttg acaggtcaga aacatttctt caaaagaaga 360 

accttttgga aactgtacag cccttttctt tcattccctt tttgctttct gtgccaatgc 420 

ctttggttct gattgcatta tggaaaacgt tgatcggaac ttgaggtttt tatttatagt 480 

gtggcttgaa agcttggata gctgttgtta cacgagatac cttattaagt ttaggccagc 540 

ttgatgcttt attttttccc tttgaagtag tgagcgttct ctggtttttt tcctttgaaa 600 
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ctggtgaggc 


ttagattttt 


ctaatgggat 


tttttacctg 


atgatctagt 


tgcataccca 


660 


aatgcttgta 


aatgttttcc 


tagttaacat 


gttgataact 


tcggatttac 


atgttgtata 


720 


tacttgtcat 


ctgtgtttct 


agtaaaaata 


tatggcattt 


atagaaatac 


gtaattcctg 


780 


atttcctttt 


tttttatctc 


tatgctctgt 


gtgtacaggt 


caaacagact 


tcactcctat 


840 


ttttatttat 


agaattttat 


atgcagtctg 


tcgttggttc 


ttgtgttgta 


aggatacagc 


900 


cttaaatttc 


ctagagcgat 


gctcagtaag 


gcgggttgtc 


acatgggttc 


aaatgtaaaa 


960 


cgggcacgtt 


tggctgctgc 


cttcccgaga 


tccaggacac 


taaactgctt 


ctgcactgag 


1020 


gtataaatcg 


cttcagatcc 


cagggaagtg 


cagatccacg 


tgcatattct 


taaagaagaa 


1080 


tgaatacttt 


ctaaaatatt 


ttggcatagg 


aagcaagctg 


catggatttg 


tttgggactt 


1140 


aaattatttt 


ggtaacggag 


tgcataggtt 


ttaaacacag 


ttgcagcatg 


ctaacgagtc 


1200 


acagcgttta tgcagaagtg 


atgcctggat 


gcctgttgca gctgtttacg 


gcactgcctt 


1260 


gcagtgagca ttgcagatag 


gggtggggtg 


ctttgtgtcg 


tgttcccaca 


cgctgccaca 


1320 


cagccacctc 


ccggaacaca tctcacctgc 


tgggtacttt 


tcaaaccatc 


ttagcagtag 


1380 


tagatgagtt actatgaaac agagaagttc ctcagttgga tattctcatg 


ggatgtcttt 


1440 


tttcccatgt 


tgggcaaagt 


atgataaagc 


atctctattt 


gtaaattatg 


cacttgttag 


1500 


ttcctgaatc 


ctttctatag 


caccacttat 


tgcagcaggt 


gtaggctctg 


gtgtggcctg 


1560 


tgtctgtgct 


tcaatctttt 


aaagcttctt 


tggaaataca 


ctgacttgat 


tgaagtctct 


1620 


tgaagatagt 


aaacagtact 


tacctttgat 


cccaatgaaa 


tcgagcattt 


cagttgtaaa 


1680 


agaattccgc 


ctattcatac 


catgtaatgt 


aattttacac 


ccccagtgct 


gacactttgg 


1740 


aatatattca 


agtaatagac 


tttggcctca 


ccctcttgtg 


tactgtattt 


tgtaatagaa 


1800 


aatattttaa 


actgtgcata 


tgattattac 


attatgaaag 


agacattctg 


ctgatcttca 


1860 


aatgtaagaa aatgaggagt gcgtgtgctt ttataaatac aagtgattgc 


aaattagtgc 


1920 


aggtgtcctt 


aaaaaaaaaa 


aaaaaaagta 


atataaaaag 


gaccaggtgt 


tttacaagtg 


1980 


aaatacattc 


ctatttggta 


aacagttaca 


tttttatgaa 


gattaccagc 


gctgctgact 


2040 


ttctaaacat 


aaggctgtat 


tgtcttcctg 


taccattgca 


tttcctcatt 


cccaatttgc 


2100 


acaaggatgt 


ctgggtaaac 


tattcaagaa 


atggctttga 


aatacagcat 


gggagcttgt 


2160 


ctgagttgga 


atgcagagtt 


gcactgcaaa 


atgtcaggaa atggatgtct 


ctcagaatgc 


2220 


ccaactccaa 


aggattttat 


atgtgtatat 


agtaagcagt 


ttcctgattc 


cagcaggcca 


2280 


aagagtctgc 


tgaatgttgt 


gttgccggag 


acctgtattt 


ctcaacaagg 


taagatggta 


2340 


tcctagcaac 


tgcggatttt 


aatacatttt 


cagcagaagt 


acttagttaa 


tctctacctt 


2400 


tagggatcgt 


ttcatcattt 


ttagatgtta 


tacttgaaat 


actgcataac 


ttttagcttt 


2460 
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aatttgctgt 


ccaacttttg 


2520 


tgttggtctt 


aaactgcaat 


agtagtttac 


cttgtattga 


agaaataaag 


accattttta 


2580 


tattaaaaaa 


tacttttgtc 


tgtcttcatt 


ttgacttgtc 


tgatatcctt 


gcagtgccca 


2640 


ttatgtcagt 


tctgtcagat 


attcagacat 


caaaacttaa 


cgtgagctca 


gtggagttac 


2700 


agctgcggtt 


ttgatgctgt 


tattatttct 


gaaactagaa 


atgatgttgt 


cttcatctgc 


2760 


tcatcaaaca 


cttcatgcag 


agtgtaaggc 


tagtgagaaa 


tgcatacatt 


tattgatact 


2820 


tttttaaagt 


caacttttta 


tcagattttt 


ttttcatttg 


gaaatatatt 


gttttctaga 


2880 


ctgcatagct 


tctgaatctg 


aaatgcagtc 


tgattggcat 


gaagaagcac 


agcactcttc 


2940 


atcttactta 


aacttcattt 


tggaatgaag 


gaagttaagc 


aagggcacag 


gtccatgaaa 


3000 


tagagacagt 


gcgctcagga 


gaaagtgaac 


ctggatttct 


ttggctagtg 


ttctaaatct 


3060 


gtagtgagga 


aagtaacacc 


cgattccttg 


aaagggctcc 


agctttaatg 


cttccaaatt 


3120 


gaaggtggca 


ggcaacttgg 


ccactggtta 


tttactgcat 


tatgtctcag 


tttcgcagct 


3180 


aacctggctt 


ctccactatt 


gagcatggac 


tatagcctgg 


cttcagaggc 


caggtgaagg 


3240 


ttgggatggg 


tggaaggagt 


gctgggctgt 


ggctgggggg 


actgtgggga 


ctccaagctg 


3300 


agcttggggt 


gggcagcaca 


gggaaaagtg 


tgggtaacta 


tttttaagta 


ctgtgttgca 


3360 


aacgtctcat 


ctgcaaatac 


gtagggtgtg 


tactctcgaa 


gattaacagt 


gtgggttcag 


3420 


taatatatgg 


atgaattcac 


agtggaagca 


ttcaagggta 


gatcatctaa 


cgacaccaga 


3480 


tcatcaagct 


atgattggaa 


gcggtatcag 


aagagcgagg 


aaggtaagca 


gtcttcatat 


3540 


gttttccctc 


cacgtaaagc 


agtctgggaa 


agtagcaccc 


cttgagcaga 


gacaaggaaa 


3600 


taattcagga 


gcatgtgcta 


ggagaacttt 


cttgctgaat 


tctacttgca 


agagctttga 


3660 


tgcctggctt 


ctggtgcctt 


ctgcagcacc 


tgcaaggccc 


agagcctgtg 


gtgagctgga 


3720 


gggaaagatt 


ctgctcaagt 


ccaagcttca 


gcaggtcatt 


gtctttgctt 


cttcccccag 


3780 


cactgtgcag 


cagagtggaa 


ctgatgtcga 


agcctcctgt 


ccactacctg 


ttgctgcagg 


3840 


cagactgctc 


tcagaaaaag 


agagctaact 


ctatgccata 


gtctgaaggt 


aaaatgggtt 


3900 


ttaaaaaaga 


aaacacaaag 


gcaaaaccgg 


■ ctgccccatg 


agaagaaagc 


agtggtaaac 


3960 


atggtagaaa 


aggtgcagaa 


gcccccaggc 


agtgtgacag 


gcccctcctg 


ccacctagag 


4020 


gcgggaacaa 


gcttccctgc 


ctagggctct 


gcccgcgaag 


tgcgtgtttc 


tttggtgggt 


4080 


tttgtttggc 


gtttggtttt 


gagatttaga 


cacaagggaa 


gcctgaaagg 


aggtgttggg 


4140 


cactattttg 


gtttgtaaag 


cctgtacttc 


aaatatatat 


tttgtgaggg 


agtgtagcga 


4200 


attggccaat 


ttaaaataaa 


gttgcaagag 


attgaaggct 


gagtagttga 


gagggtaaca 


4260 


cgtttaatga 


gatcttctga 


aactactgct 


tctaaacact 


tgtttgagtg 


gtgagacctt 


4320 


ggataggtga 


gtgctcttgt 


tacatgtctg 


atgcacttgc 
Page 


ttgtcctttt 
21 


ccatccacat 


4380 
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ccatgcattc 


cacatccacg 


catttgtcac 


ttatcccata tctgtcatat 


ctgacatacc 


4440 


tgtctcttcg 


tcacttggtc 


agaagaaaca gatgtgataa tccccagccg 


ccccaagttt 


4500 


gagaagatgg 


cagttgcttc 


tttccctttt 


tcctgctaag 


taaggatttt 


ctcctggctt 


4560 


tgacacctca 


cgaaatagtc 


ttcctgcctt 


acattctggg 


cattatttca 


aatatctttg 


4620 


gagtgcgctg 


ctctcaagtt 


tgtgtcttcc 


tactcttaga 


gtgaatgctc 


ttagagtgaa 


4680 


agagaaggaa 


gagaagatgt 


tggccgcagt 


tctctgatga 


acacacctct 


gaataatggc 


4740 


caaaggtggg 


tgggtttctc 


tgaggaacgg 


gcagcgtttg 


cctctgaaag 


caaggagctc 


4800 


tgcggagttg 


cagttatttt 


gcaactgatg 


gtggaactgg 


tgcttaaagc 


agattcccta 


4860 


ggttccctgc 


tacttctttt 


ccttcttggc 


agtcagttta 


tttctgacag 


acaaacagcc 


4920 


acccccactg 


caggcttaga 


aagtatgtgg 


ctctgcctgg 


gtgtgttaca 


gctctgccct 


4980 


ggtgaaaggg 


gattaaaacg 


ggcaccattc 


atcccaaaca 


ggatcctcat 


tcatggatca 


5040 


agctgtaagg 


aacttgggct 


ccaacctcaa 


aacattaatt 


ggagtacgaa 


tgtaattaaa 


5100 


actgcattct 


cgcattccta agtcatttag tctggactct gcagcatgta 


ggtcggcagc 


5160 


tcccactttc 


tcaaagacca 


ctgatggagg 


agtagtaaaa atggagaccg 


attcagaaca 


5220 


accaacggag 


tgttgccgaa 


gaaactgatg 


gaaataatgc atgaattgtg 


tggtggacat 


5280 


tttttttaaa 


tacataaact 


acttcaaatg 


aggtcggaga aggtcagtgt 


tttattagca 


5340 


gccataaaac 


caggtgagcg 


agtaccattt 


ttctctacaa 


gaaaaacgat 


tctgagctct 


5400 


gcgtaagtat aagttctcca tagcggctga agctcccccc tggctgcctg 


ccatctcagc 


5460 


tggagtgcag tgccatttcc ttggggtttc tctcacagca gtaatgggac 


aatacttcac 


5520 


aaaaattctt 


tcttttcctg tcatgtggga tccctactgt gccctcctgg 


ttttacgtta 


5580 


ccccctgact gttccattca gcggtttgga aagagaaaaa gaatttggaa 


ataaaacatg 


5640 


tctacgttat 


cacctcctcc 


agcattttgg 


tttttaatta 


tgtcaataac 


tggcttagat 


5700 


ttggaaatga 


gagggggttg 


ggtgtattac 


cgaggaacaa aggaaggctt 


atataaactc 


5760 


aagtctttta 


tttagagaac 


tggcaagctg 


tcaaaaacaa 


aaaggcctta 


ccaccaaatt 


5820 


aagtgaatag 


ccgctatagc 


cagcagggcc 


agcacgaggg 


atggtgcact 


gctggcacta 


5880 


tgccacggcc 


tgcttgtgac 


tctgagagca 


actgctttgg 


aaatgacagc 


acttggtgca 


5940 


atttcctttg 


tttcagaatg 


cgtagagcgt 


gtgcttggcg 


acagtttttc 


tagttaggcc 


6000 


acttcttttt 


tccttctctc 


ctcattctcc 


taagcatgtc 


tccatgctgg 


taatcccagt 


6060 


caagtgaacg 


ttcaaacaat 


gaatccatca 


ctgtaggatt 


ctcgtggtga 


tcaaatcttt 


6120 


gtgtgaggtc 


tataaaatat 


ggaagcttat 


ttatttttcg 


ttcttccata 


tcagtcttct 


6180 


ctatgacaat 


tcacatccac 


cacagcaaat 


taaaggtgaa ggaggctggt 


gggatgaaga 


6240 
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gggtcttcta gctttacgtt 


cttccttgca aggccacagg aaaatgctga gagctgtaga 


bJUU 


atacagcctg 


gggtaagaag 


ttcagtctcc 


tgctgggaca 


gctaaccgca tcttataacc 




ccttctgaga 


ctcatcttag 


gaccaaatag 


ggtctatctg 


gggtttttgt tcctgctgtt 


b4/U 


cctcctggaa ggctatctca 


ctatttcact 


gctcccacgg 


ttacaaacca 


aagatacagc 


04oU 


ctgaattttt tctaggccac attacataaa tttgacctgg taccaatatt gttctctata 


d540 


tagttatttc 


cttccccact 


gtgtttaacc ccttaaggca ttcagaacaa ctagaatcat 


ooOu 


agaatggttt 


ggattggaag 


gggccttaaa 


catcatccat 


ttccaaccct 


ctgccatggg 


DDOO 


ctgcttgcca 


cccactggct 


caggctgccc 


agggccccat 


ccagcctggc 


cttgagcacc 


6720 


tccagggatg 


gggcacccac agcttctctg 


ggcagcctgt 


gccaacacct 


caccactctc 


6780 


tgggtaaaga attctctttt 


aacatctaat 


ctaaatctct 


tctcttttag 


tttaaagcca 


6840 


ttcctctttt 


tcccgttgct atctgtccaa gaaatgtgta ttggtctccc tcctgcttat 


6900 


aagcaggaag 


tactggaagg 


ctgcagtgag 


gtctccccac 


agccttctct 


tctccaggct 


6960 


gaacaagccc agctccttca gcctgtcttc gtaggagatc atcttagtgg ccctcctctg 


70Z0 


gacccattcc 


aacagttcca 


cggctttctt 


gtggagcccc 


aggtctggat 


gcagtacttc 


7080 


agatggggcc 


ttacaaaggc 


agagcagatg 


gggacaatcg 


cttacccctc 


cctgctggct 


/X4U 


gcccctgttt 


tgatgcagcc 


cagggtactg 


ttggcctttc 


aggctcccag 


accccttgct 




gatttgtgtc 


aagcttttca 


tccaccagaa 


cccacgcttc 


ctggttaata 


cttctgccct 


7Z60 


cacttctgta 


agcttgtttc 


aggagacttc 


cattctttag 


gacagactgt 


gttacaccta 


75Z0 


cctgccctat 


tcttgcatat 


atacatttca 


gttcatgttt 


cctgtaacag 


gacagaatat 


"7 ^ O A 

7380 


gtattcctct 


aacaaaaata 


catgcagaat 


tcctagtgcc 


atctcagtag 


ggttttcatg 


/440 


gcagtattag 


cacatagtca 


atttgctgca 


agtaccttcc 


aagctgcggc 


ctcccataaa 


7500 


tcctgtattt 


gggatcagtt 


accttttggg 


gtaagctttt 


gtatctgcag 


agaccctggg 


7560 


ggttctgatg 


tgcttcagct 


ctgctctgtt 


ctgactgcac 


cattttctag 


atcacccagt 


7620 


tgttcctgta 


caacttcctt 


gtcctccatc 


ctttcccagc 


ttgtatcttt 


gacaaataca 


7680 


ggcctatttt 


tgtgtttgct 


tcagcagcca tttaattctt 


cagtgtcatc 


ttgttctgtt 


7740 


gatgccactg 


gaacaggatt 


ttcagcagtc ttgcaaagaa 


catctagctg 


aaaactttct 


7800 


gccattcaat 


attcttacca 


gttcttcttg tttgaggtga gccataaatt actagaactt 


/ooU 


cgtcactgac 


aagtttatgc 


attttattac 


ttctattatg 


tacttacttt 


gacataacac 


7920 


agacacgcac 


atattttgct 


gggatttcca cagtgtctct 


gtgtccttca 


catggtttta 


7980 


ctgtcatact 


tccgttataa 


ccttggcaat 


ctgcccagct 


gcccatcaca 


agaaaagaga 


8040 


ttcctttttt 


attacttctc 


ttcagccaat aaacaaaatg tgagaagccc aaacaagaac 


8100 


ttgtggggca 


ggctgccatc 


aagggagaga 


cagctgaagg 


gttgtgtagc 


tcaatagaat 


8160 
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taagaaataa 


taaagctgtg 


tcagacagtt 


ttgcctgatt 


tatacaggca 


cgccccaagc 


8220 


cagagaggct 


gtctgccaag 


gccaccttgc 


agtccttggt 


ttgtaagata 


agtcataggt 


8280 


aacttttctg 


gtgaattgcg 


tggagaatca 


tgatggcagt 


tcttgctgtt 


tactatggta 


8340 


agatgctaaa 


ataggagaca 


gcaaagtaac 


acttgctgct 


gtaggtgctc 


tgctatccag 


8400 


acagcgatgg 


cactcgcaca 


ccaagatgag 


ggatgctccc 


agctgacgga 


tgctggggca 


8460 


gtaacagtgg 


gtcccatgct 


gcctgctcat 


tagcatcacc 


tcagccctca 


ccagcccatc 


8520 


agaaggatca 


tcccaagctg 


aggaaagttg 


ctcatcttct 


tcacatcatc 


aaacctttgg 


8580 


cctgactgat 


gcctcccgga 


tgcttaaatg 


tggtcactga 


catctttatt 


tttctatgat 


8640 


ttcaagtcag 


aacctccgga 


tcaggaggga 


acacatagtg 


ggaatgtacc 


ctcagctcca 


8700 


aggccagatc 


ttccttcaat 


gatcatgcat 


gctacttagg 


aaggtgtgtg 


tgtgtgaatg 


8760 


tagaattgcc 


tttgttattt 


tttcttcctg 


ctgtcaggaa 


cattttgaat 


accagagaaa 


8820 


aagaaaagtg 


ctcttcttgg 


catgggagga 


gttgtcacac 


ttgcaaaata 


aaggatgcag 


8880 


tcccaaatgt 


tcataatctc 


agggtctgaa 


ggaggatcag 


aaactgtgta 


tacaatttca 


8940 


ggcttctctg 


aatgcagctt 


ttgaaagctg 


ttcctggccg 


aggcagtact 


agtcagaacc 


9000 


ctcggaaaca 


ggaacaaatg 


tcttcaaggt 


gcagcaggag 


gaaacacctt 


gcccatcatg 


9060 


aaagtgaata 


accactgccg 


ctgaaggaat 


ccagctcctg 


tttgagcagg 


tgctgcacac 


9120 


tcccacactg 


aaacaacagt 


tcatttttat 


aggacttcca 


ggaaggatct 


tcttcttaag 


9180 


cttcttaatt 


atggtacatc 


tccagttggc 


agatgactat 


gactactgac 


aggagaatga 


9240 


ggaactagct 


gggaatattt 


ctgtttgacc 


accatggagt 


cacccatttc 


tttactggta 


9300 


tttggaaata 


ataattctga 


attgcaaagc 


aggagttagc 


gaagatcttc 


atttcttcca 


9360 


tgttggtgac 


agcacagttc 


tggctatgaa 


agtctgctta 


caaggaagag 


gataaaaatc 


9420 


atagggataa 


taaatctaag 


tttgaagaca 


atgaggtttt 


agctgcattt 


gacatgaaga 


9480 


aattgagacc 


tctactggat 


agctatggta 


tttacgtgtc 


tttttgctta 


gttacttatt 


9540 


gaccccagct 


gaggtcaagt 


atgaactcag 


gtctctcggg 


ctactggcat 


ggattgatta 


9600 


catacaactg 


taattttagc 


agtgatttag 


ggtttatgag 


tacttttgca 


gtaaatcata 


9660 


gggttagtaa 


tgttaatctc 


agggaaaaaa 


aaaaaaagcc 


aaccctgaca 


gacatcccag 


9720 


ctcaggtgga 


aatcaaggat 


cacagctcag 


tgcggtccca 


gagaacacag 


ggactcttct 


9780 


cttaggacct 


ttatgtacag 


ggcctcaaga 


taactgatgt 


tagtcagaag 


actttccatt 


9840 


ctggccacag 


ttcagctgag 


gcaatcctgg 


aattttctct 


ccgctgcaca 


gttccagtca 


9900 


tcccagtttg 


tacagttctg 


gcactttttg 


ggtcaggccg 


tgatccaagg 


agcagaagtt 


9960 


ccagctatgg 


tcagggagtg 


cctgaccgtc 


ccaactcact 


gcactcaaac 


aaaggcgaaa 


10020 
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ccacaagagt ggcttttgtt gaaattgcag tgtggcccag aggggctgca ccagtactgg 10080 

attgaccacg aggcaacatt aatcctcagc aagtgcaatt tgcagccatt aaattgaact 10140 

aactgatact acaatgcaat cagtatcaac aagtggtttg gcttggaaga tggagtctag 10200 

gggctctaca ggagtagcta ctctctaatg gagttgcatt ttgaagcagg acactgtgaa 10260 

aagctggcct cctaaagagg ctgctaaaca ttagggtcaa ttttccagtg cactttctga 10320 

agtgtctgca gttccccatg caaagctgcc caaacatagc acttccaatt gaatacaatt 10380 

atatgcaggc gtactgcttc ttgccagcac tgtccttctc aaatgaactc aacaaacaat 10440 

ttcaaagtct agtagaaagt aacaagcttt gaatgtcatt aaaaagtata tctgctttca 10500 

gtagttcagc ttatttatgc ccactagaaa catcttgtac aagctgaaca ctggggctcc 10560 

agattagtgg taaaacctac tttatacaat catagaatca tagaatggcc tgggttggaa 10620 

gggaccccaa ggatcatgaa gatccaacac ccccgccaca ggcagggcca ccaacctcca 10680 

gatctggtac tagaccaggc agcccagggc tccatccaac ctggccatga acacctccag 10740 

ggatggagca tccacaacct ctctgggcag cctgtgccag cacctcacca ccctctctgt 10800 

gaagaacttt tccctgacat ccaatctaag ccttccctcc ttgaggttag atccactccc 10860 

ccttgtgcta tcactgtcta ctcttgtaaa aagttgattc tcctcctttt tggaaggttg 10920 

caatgaggtc tccttgcagc cttcttctct tctgcaggat gaacaagccc agctccctca 10980 

gcctgtcttt ataggagagg tgctccagcc ctctgatcat ctttgtggcc ctcctctgga 11040 

cccgctccaa gagctccaca tctttcctgt actgggggcc ccaggcctga atgcagtact 11100 

ccagatgggg cctcaaaaga gcagagtaaa gagggacaat caccttcctc accctgctgg 11160 

ccagccctct tctgatggag ccctggatac aactggcttt ctgagctgca acttctcctt 11220 

atcagttcca ctattaaaac aggaacaata caacaggtgc tgatggccag tgcagagttt 11280 

ttcacacttc ttcatttcgg tagatcttag atgaggaacg ttgaagttgt gcttctgcgt 11340 

gtgcttcttc ctcctcaaat actcctgect gatacctcac cccacctgcc actgaatggc 11400 

tccatggccc cctgcagcca gggccctgat gaacccggca ctgcttcaga tgctgtttaa 11460 

tagcacagta tgaccaagtt gcacctatga atacacaaac aatgtgttgc atccttcagc 11520 

acttgagaag aagagccaaa tttgcattgt caggaaatgg tttagtaatt ctgccaatta 11580 

aaacttgttt atctaccatg gctgttttta tggctgttag tagtggtaca ctgatgatga 11640 

acaatggcta tgcagtaaaa tcaagactgt agatattgca acagactata aaattcctct 11700 

gtggcttagc caatgtggta cttcccacat tgtataagaa atttggcaag tttagagcaa 11760 

tgtttgaagt gttgggaaat ttctgtatac tcaagagggc gtttttgaca actgtagaac 11820 

agaggaatca aaagggggtg ggaggaagtt aaaagaagag gcaggtgcaa gagagcttgc 11880 

agtcccgctg tgtgtacgac actggcaaca tgaggtcttt gctaatcttg gtgctttgct 11940 
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tcctgcccct ggctgcctta gggtgcgatc tgcctcagac ccacagcctg ggcagcagga 12000 

ggaccctgat gctgctggct cagatgagga gaatcagcct gtttagctgc ctgaaggata 12060 

ggcacgattt tggctttcct caagaggagt ttggcaacca gtttcagaag gctgagacca 12120 

tccctgtgct gcacgagatg atccagcaga tctttaacct gtttagcacc aaggatagca 12180 

gcgctgcttg ggatgagacc ctgctggata agttttacac cgagctgtac cagcagctga 12240 

acgatctgga ggcttgcgtg atccagggcg tgggcgtgac cgagacccct ctgatgaagg 12300 

aggatagcat cctggctgtg aggaagtact ttcagaggat caccctgtac ctgaaggaga 12360 

agaagtacag cccctgcgct tgggaagtcg tgagggctga gatcatgagg agctttagcc 12420 

tgagcaccaa cctgcaagag agcttgaggt ctaaggagta aaaagtctag agtcggggcg 12480 

gccggccgct tcgagcagac atgataagat acattgatga gtttggacaa accacaacta 12540 

gaatgcagtg aaaaaaatgc tttatttgtg aaatttgtga tgctattgct ttatttgtaa 12600 

ccattataag ctgcaataaa caagttaaca acaacaattg cattcatttt atgtttcagg 12660 

ttcaggggga ggtgtgggag gttttttaaa gcaagtaaaa cctctacaaa tgtggtaaaa 12720 

tcgataagga tccgtcgacc gatgcccttg agagccttca acccagtcag ctccttccgg 12780 

tgggcgcggg gcatgactat cgtcgccgca cttatgactg tcttctttat catgcaactc 12840 

gtaggacagg tgccggcagc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt 12900 

cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga 12960 

atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg 13020 

taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa 13080 

aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt 13140 

tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct 13200 

gtccgccttt ctcccttcgg gaagcgtggc gctttctcaa tgctcacgct gtaggtatct 13260 

cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc 13320 

cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt 13380 

atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc 13440 

tacagagttc ttgaagtggt ggcctaacta cggctacact agaaggacag tatttggtat 13500 

ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa 13560 

acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa 13620 

aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga 13680 

aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct 13740 

tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga 13800 
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cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc 13860 

catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct taccatctgg 13920 

ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt tatcagcaat 13980 

aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat ccgcctccat 14040 

ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta atagtttgcg 14100 

caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc 14160 

attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa 14220 

agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg cagtgttatc 14280 

actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg taagatgctt 14340 

ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc ggcgaccgag 14400 

ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa ctttaaaagt 14460 

gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac cgctgttgag 14520 

atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt ttactttcac 14580 

cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc 14640 

gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa gcatttatca 14700 

gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata aacaaatagg 14760 

ggttccgcgc acatttcccc gaaaagtgcc acctgacgcg ccctgtagcg gcgcattaag 14820 

cgcggcgggt gtggtggtta cgcgcagcgt gaccgctaca cttgccagcg ccctagcgcc 14880 

cgctcctttc gctttcttcc cttcctttct cgccacgttc gccggctttc cccgtcaagc 14940 

tctaaatcgg gggctccctt tagggttccg atttagtgct ttacggcacc tcgaccccaa 15000 

aaaacttgat tagggtgatg gttcacgtag tgggccatcg ccctgataga cggtttttcg 15060 

ccctttgacg ttggagtcca cgttctttaa tagtggactc ttgttccaaa ctggaacaac 15120 

actcaaccct atctcggtct attcttttga tttataaggg attttgccga tttcggccta 15180 

ttggttaaaa aatgagctga tttaacaaaa atttaacgcg aattttaaca aaatattaac 15240 

gtttacaatt tcccattcgc cattcaggct gcgcaactgt tgggaagggc gatcggtgcg 15300 

ggcctcttcg ctattacgcc agcccaagct accatgataa gtaagtaata ttaaggtacg 15360 

ggaggtactt ggagcggccg ctctagaact agtggatccc ccggccgcaa taaaatatct 15420 

ttattttcat tacatctgtg tgttggtttt ttgtgtgaat cgatagtact aacatacgct 15480 

ctccatcaaa acaaaacgaa acaaaacaaa ctagcaaaat aggctgtccc cagtgcaagt 15540 

gcaggtgcca gaacatttct ctatcgatag gtaccgagct cttacgcgtg ctagccctcg 15600 

agcaggatct atacattgaa tcaatattgg caattagcca tattagtcat tggttatata 15660 

gcataaatca atattggcta ttggccattg catacgttgt atctatatca taatatgtac 15720 
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atttatattg gctcatgtcc aatatgaccg ccatgttgac attgattatt gactagttat 15780 

taatagtaat caattacggg gtcattagtt catagcccat atatggagtt ccgcgttaca 15840 

taacttacgg taaatggccc gcctggctga ccgcccaacg acccccgccc attgacgtca 15900 

ataatgacgt atgttcccat agtaacgcca atagggactt tccattgacg tcaatgggtg 15960 

gagtatttac ggtaaactgc ccacttggca gtacatcaag tgtatcatat gccaagtccg 16020 

ccccctattg acgtcaatga cggtaaatgg cccgcctggc attatgccca gtacatgacc 16080 

ttacgggact ttcctacttg gcagtacatc tacgtattag tcatcgctat taccatggtg 16140 

atgcggtttt ggcagtacat caatgggcgt ggatagcggt ttgactcacg gggatttcca 16200 

agtctccacc ccattgacgt caatgggagt ttgttttggc accaaaatca acgggacttt 16260 

ccaaaatgtc gtaacaactc cgccccattg acgcaaatgg gcggtaggcg tgtacggtgg 16320 

gaggtctata taagcagagc tcgtttagtg aaccgtcaga tcgcctggag acgccatcca 16380 

cgctgttttg acctccatag aagacaccgg gaccgatcca gcctcccctc gaagctcgac 16440 

tctaggggct cgagatctgc gatctaagta agcttgcatg cctgcaggtc ggccgccacg 16500 

accggtgccg ccaccatccc ctgacccacg cccctgaccc ctcacaagga gacgaccttc 16560 

catgaccgag tacaagccca cggtgcgcct cgccacccgc gacgacgtcc cccgggccgt 16620 

acgcaccctc gccgccgcgt tcgccgacta ccccgccacg cgccacaccg tcgacccgga 16680 

ccgccacatc gagcgggtca ccgagctgca agaactcttc ctcacgcgcg tcgggctcga 16740 

catcggcaag gtgtgggtcg cggacgacgg cgccgcggtg gcggtctgga ccacgccgga 16800 

gagcgtcgaa gcgggggcgg tgttcgccga gatcggcccg cgcatggccg agttgagcgg 16860 

ttcccggctg gccgcgcagc aacagatgga aggcctcctg gcgccgcacc ggcccaagga 16920 

gcccgcgtgg ttcctggcca ccgtcggcgt ctcgcccgac caccagggca agggtctggg 16980 

cagcgccgtc gtgctccccg gagtggaggc ggccgagcgc gccggggtgc ccgccttcct 17040 

ggagacctcc gcgccccgca acctcccctt ctacgagcgg ctcggcttca ccgtcaccgc 17100 

cgacgtcgag gtgcccgaag gaccgcgcac ctggtgcatg acccgcaagc ccggtgcctg 17160 

acgcccgccc cacgacccgc agcgcccgac cgaaaggagc gcacgacccc atggctccga 17220 

ccgaagccga cccgggcggc cccgccgacc ccgcacccgc ccccgaggcc caccgactct 17280 

agagtcgggg cggccggccg cttcgagcag acatgataag atacattgat gagtttggac 17340 

aaaccacaac tagaatgcag tgaaaaaaat gctttatttg tgaaatttgt gatgctattg 17400 

ctttatttgt aaccattata agctgcaata aacaagttaa caacaacaat tgcattcatt 17460 

ttatgtttca ggttcagggg gaggtgtggg aggtttttta aagcaagtaa aacctctaca 17520 

aatgtggtaa aatcgataag gatcaattcg gcttcaggta ccgtcgacga tgtaggtcac 17580 
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ggtctcgaag 


ccgcggtgcg 


ggtgccaggg 


025CIP SEQ 
cgtgcccttg 


List.txt 
ggctccccgg 


gcgcgtactc 


17640 


cacctcaccc 


atctggtcca 


tcatgatgaa cgggtcgagg tggcggtagt 


tgatcccggc 


17700 


gaacgcgcgg 


cgcaccggga 


agccctcgcc 


ctcgaaaccg 


ctgggcgcgg 


tggtcacggt 


17760 


gagcacggga 


cgtgcgacgg 


cgtcggcggg 


tgcggatacg 


cggggcagcg 


tcagcgggtt 


17820 


ctcgacggtc 


acggcgggca 


tgtcgacagc cgaattgatc 


cgtcgaccga 


tgcccttgag 


17880 


agccttcaac 


ccagtcagct 


ccttccggtg 


ggcgcggggc 


a *!* /I a r"*!" a "It" 
cl L^dU Ld Ll-^ 


tcgccgcact 


17940 


tatgactgtc 


ttctftatca 


tgcaactcgt aggacaggtg 


ccggcagcgc 


tcttccgctt 


18000 


cctcgctcac 


tgactcgctg 


cgctcggtcg ttcggctgcg 


gcgagcggta 


tcagctcact 


18060 


caaaggcggt 


aatacggtta 


tccacagaat 


caggggataa 


cgcaggaaag 


aacatg 


18116 


<210> 8 
<211> 17402 
<212> DNA 

<213> Plasmid pOMlFN- 


-ins-CMV-pur-attB 








<400> 8 
ggccgccacc 


gcggtggagc 


tccaattcgc 


cctatagtga gtcgtattac 


aattcactgg 


60 


ccgtcgtttt 


acaacgtcgt 


gactgggaaa 


accctggcgt 


tacccaact't 


aatcgccttg 


120 


CagCaCaX CC 


CCCT. uT.c.yc.c 


^ /-f /~ -f- f-t f— /~i •#- 
agcxygcyxci 


atagcgaaga ggcccgcacc 


gatcgccctt 


180 


cccaacagtt 


gcgcagcctg 


aatggcgaat 


gggacgcgcc 


ctgtagcggc 


gcattaagcg 


240 


cggcgggtgt 


ggtggttacg 


cgcagcgtga 


ccgctacact 


tgccagcgcc 


ctagcgcccg 


300 


ctcctttcgc 


tttcttccct 


tcctttctcg 


ccacgttcgc 


cggctttccc 


cgtcaagctc 


360 


taaatcgggg 


gctcccttta 


gggttccgat 


ttagtgcttt 


acggcacctc 


gaccccaaaa 


420 


aacttgatta 


gggtgatggt 


tcacgtagtg 


ggccatcgcc 


ctgatagacg 


gtttttcgcc 


480 


ctttgacgtt 


ggagtccacg 


ttctttaata 


gtggactctt 


gttccaaact 


ggaacaacac 


540 


tcaaccctat 


ctcggtctat 


tcttttgatt 


tataagggat 


tttgccgatt 


tcggcctatt 


600 


ggttaaaaaa 


tgagctgatt 


taacaaaaat 


ttaacgcgaa ttttaacaaa 


atattaacgc 


660 


ttacaattta 


ggtggcactt 


ttcggggaaa 


tgtgcgcgga 


acccctattt 


gtttattttt 


720 


ctaaatacat 


tcaaatatgt 


atccgctcat 


gagacaataa 


ccctgataaa 


tgcttcaata 


780 


atattgaaaa 


aggaagagta 


tgagtattca 


acatttccgt 


gtcgccctta 


ttcccttttt 


840 


tgcggcattt 


tgccttcctg 


tttttgctca 


cccagaaacg 


ctggtgaaag 


taaaagatgc 


900 


tgaagatcag 


ttgggtgcac 


gagtgggtta 


catcgaactg 


gatctcaaca 


gcggtaagat 


960 


ccttgagagt 


tttcgccccg 


aagaacgttt 


tccaatgatg 


agcactttta 


aagttctgct 


1020 


itgtggcgcg 


gtattatccc 


gtattgacgc 


cgggcaagag 


caactcggtc 


gccgcataca 


1080 


ztattctcag 


aatgacttgg 


ttgagtactc 


accagtcaca 


gaaaagcatc 


ttacggatgg 


1140 
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catgacagta 


aoaaaattat 


y v.a.y 


cataaccatg 


agtgataaca 


^- Ly cygcCdd 




cttacttctg 


acaacgatcg 


y y y t*- V- y c* 


ggagctaacc gcttttttgc 


d^ddv.di.yyy 


1 'PAH 


ggatcatgta 


actcgccttg 




accggagctg 


aatgaagcca 


l,.dL.^dddCyd 




cgagcgtgac 


accacaatiac 


ct'Q't'a aca a 1" 


ggcaacaacg 


ttgcgcaaac 


Xdl.T.a.dCX.yg 




caaactactt 




V- V> V. y y V^dCLVri Cl 


attaatagac tggatggagg 


cggaraaagx 


X440 


tacaaaacca 




L>yy^\M^ i. 


ggctggctgg 


tttattgctg 


ataaatcEgg 


1500 






y v*y y lci lv-ci l 


tgcagcactg 


gggccagatg 


g uaagcccnc 


1560 


ccatat catia 




^ya.v.yyyy<ty 


tcaggcaact atggatgaac 


gaaaxagaca 


loZO 


aat cactaaa 


ataaatacct 


r" a p i" n a "t" I" a a 


gcattggtaa ctgtcagacc 


aagx kXacT c 


IbtsO 


atatatactt 


taaat'taa'tl" 


1" a a a a ft" i" r~ a 


tttttaattt 


aaaaggatct 


aggt:gaagat: 


1/40 


VffVtf t> L. b k 1. 


a a 1" fi" f a "t* fi a 


ffaaaatT"*"*" 


ttaacgtgag 


ttttcgttcc 


actgagcgtc 


1800 


saa.ccccQ'ta 

^ ^ ^ ^ 23 


ciaaaannlTa 


aanna+f ^'t'f 
cidyyci i. 


ttgagatcct 


ttttttctgc 


gcgtaatctg 


■1 OCA 

1860 


ctacttacaa 


acaaaaaaap 


f apffif l"af f" 


aqcaqtaatt 


'Cqt'ttaccaa 


atcaagagct 


lyzo 


accaactctt 




I" a a ftTi n r* t"l" 


caqcaaaqca 


cagataccaa 


4* #^*ft* ^^^^ 

aXACXQX.CCX 




'tc'tag'tg'tag 


ccQ'tao't'tari 


appa fpa pt**!" 


caagaactct 


gtagcaccgc 


^'t' 9 ^ a ^ ^ 

cxaCdtacct 




cactctacta 


a t" c c t" a t" "t a p 


r* a n 1" n n r" n f* 


tgccagtggc 


gataagtcgt 


+" ^ rm 

9 XCTlLdCCyg 




gttggactca 


aaacaataat: 


■taccaaat'aa 


ggcgcagcgg 


tcgqqctqaa 


cggggggttc 




atacacacaa 


cccaocft'aci 


«. y y c* d V- y « ^ 


ctacaccgaa 


ctgagatacc 


xacagcgxga 




gctatigagaa 




^ ^^v>v.ycLCLyy 


y <*y <*d-«-y y V- y 




cggtaagcgg 




caoaatcaaa 


acaaaaa aac 


y v.a.\-ycxyyya 


gcttccaggg 


ggaaacgcct 


gguaxc X L xa 




taatrrlTiiT 


yyy <- Lv_y i-t- 


d^l^ Ly dv- L 


tgagcgtcga 


tttttgtgat 


gctcgtcagg 


2400 






cit- y \_ i_ d y i_ ci d 


cgcggccttt 


ttacggttcc 


tggccttttg 


2460 


ctaacctttt 


n r" t" (" a r* a 1" n "t" 




gttatcccct 


gattctgtgg 


ataaccgtat 


2520 


taccaccttt 


y"-y i.y<*y i-y 


a "ha 1" f fifhrri 


ccgcagccga 


acgaccgagc 


gcagcgagtc 


2580 


aatqaacaaa 


53 jj SJ 


wy v>^Vacm L 


acgcaaaccg 


cctctccccg 


cycgxLggcc 


ZD4U 


gattcattaa 


tacaactaac 




tcccgactgg 


aaagcgggca 


g Lgagcgcaa 


z/uu 


cgcaattaat 


Q'taao't'taac 


"tcac'tf"al"t*a 


ggcaccccag 


gctttacact 


X kaxycxxcc 


^/ OU 


ggcT.cgi.ai.g 


tTgrgtggaa 


ttgtgagcgg 


ataacaattt 


cacacaggaa 


acagctatga 


2820 


ccatgattac 


gccaagctcg 


aaattaaccc 


tcactaaagg 


gaacaaaagc 


tgggtaccgg 


2880 


gccccccctc 


gactagaggg 


acagcccccc 


cccaaagccc 


ccagggatgt 


aattacgtcc 


2940 


ctcccccgct 


agggggcagc 


agcgagccgc 


ccggggctcc 


gctccggtcc 


ggcgctcccc 


3000 


ccgcatcccc 


gagccggcag 


cgtgcgggga 


cagcccgggc 


acggggaagg 


tggcacggga 


3060 
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tcgctttcct ctgaacgctt ctcgctgctc tttgagcctg cagacacctg gggggatacg 3120 

gggaaaaagc tttaggctga aagagagatt tagaatgaca gaatcataga acggcctggg 3180 

ttgcaaagga gcacagtgct catccagatc caaccccctg ctatgtgcag ggtcatcaac 3240 

cagcagccca ggctgcccag agccacatcc agcctggcct tgaatgcctg cagggatggg 3300 

gcatccacag cctccttggg caacctgttc agtgcgtcac caccctctgg gggaaaaact 3360 

gcctcctcat atccaaccca aacctcccct gtctcagtgt aaagccattc ccccttgtcc 3420 

tatcaagggg gagtttgctg tgacattgtt ggtctggggt gacacatgtt tgccaattca 3480 

gtgcatcacg gagaggcaga tcttggggat aaggaagtgc aggacagcat ggacgtggga 3540 

catgcaggtg ttgagggctc tgggacactc tccaagtcac agcgttcaga acagccttaa 3600 

ggataagaag ataggataga aggacaaaga gcaagttaaa acccagcatg gagaggagca 3660 

caaaaaggcc acagacactg ctggtccctg tgtctgagcc tgcatgtttg atggtgtctg 3720 

gatgcaagca gaaggggtgg aagagcttgc ctggagagat acagctgggt cagtaggact 3780 

gggacaggca gctggagaat tgccatgtag atgttcatac aatcgtcaaa tcatgaaggc 3840 

tggaaaagcc ctccaagatc cccaagacca accccaaccc acccaccgtg cccactggcc 3900 

atgtccctca gtgccacatc cccacagttc ttcatcacct ccagggacgg tgaccccccc 3960 

acctccgtgg gcagctgtgc cactgcagca ccgctctttg gagaaggtaa atcttgctaa 4020 

atccagcccg accctcccct ggcacaacgt aaggccatta tctctcatcc aactccagga 4080 

cggagtcagt gaggatgggg ctctagtcga ggtcgacggt atcgataagc ttgattaggc 4140 

agagcaatag gactctcaac ctcgtgagta tggcagcatg ttaactctgc actggagtcc 4200 

agcgtgggaa acaatctgcc ttgcacatga gtcttcgtgg gccaatattc cccaacggtt 4260 

ttccttcagc ttgtcttgtc tcctaagctc tcaaaacacc tttttggtga ataaactcac 4320 

ttggcaacgt ttatctgtct taccttagtg tcacgtttca tccctattcc cctttctcct 4380 

cctccgtgtg gtacacagtg gtgcacactg gttcttctgt tgatgttctg ctctgacagc 4440 

caatgtgggt aaagttcttc ctgccacgtg tctgtgttgt tttcacttca aaaagggccc 4500 

tgggctcccc ttggagctct caggcatttc cttaatcatc acagtcacgc tggcaggatt 4560 

agtccctcct aaaccttaga atgacctgaa cgtgtgctcc ctctttgtag tcagtgcagg 4620 

gagacgtttg cctcaagatc agggtccatc tcacccacag ggccattccc aagatgaggt 4680 

ggatggttta ctctcacaaa aagttttctt atgtttggct agaaaggaga actcactgcc 4740 

tacctgtgaa ttcccctagt cctggttctg ctgccactgc tgcctgtgca gcctgtccca 4800 

tggagggggc agcaactgct gtcacaaagg tgatcccacc ctgtctccac tgaaatgacc 4860 

tcagtgccac gtgttgtata gggtataaag tacgggaggg ggatgcccgg ctcccttcag 4920 
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ggttgcagag 
aacttcagtc 
aatcactgca 
cctgtgattg 
ttgtttcagc 
ttgtgtctgt 
tagcgtaggt 
aacacaatta 
gagctgaaga 
agggccatag 
gtaactaaca 
tggtttatat 
ttaatctagt 
tggctttggc 
tgaagctatc 
ctgtggaaca 
taagaagaag 
tgcagcattt 
ggaaaagcct 
aatatcactc 
tgctgcagtg 
gatttctcct 
agaggaaacg 
gtcactaaga 
ggctgtaaat 
atctcctgag 
gagcgcgggg 
gtgattttct 
tcataggcca 
lagcactgaa 
:aactcctct 
jagaaagccc 



cagaagtgtc 
ctagtgtttt 
atcgcctatg 
gctttaattg 
caattcttgc 
atggatgaca 
acagtcacca 
tttggaccag 
gcaaatgttc 
cacctccatg 
atggcttatt 
aaccttcttg 
ttaactgttg 
agactttagc 
aagatcctac 
cattgtctga 
tctgtgcaat 
gccagtaggt 
gatactagaa 
tataaaagta 
ctgtttttcg 
tgtgtcatgt 
ttgcattttg 
gatctgaagg 
ggatggaagg 
ctcggtgctt 
agctttggta 
tactcacaca 
ttcagttgtc 
ggatcagttg 
catgaacaaa 
tgaatttctg 



tgtgtataga 
gtgggctgga 
ttctgaaggt 
aatggcaaat 
agccaaagag 

gggaggtagg 

cctccagctc 
tttggaaagt 
ccaacttgga 
ctattagatt 
gtgcttgtct 
ccatttgctc 
cttgattatt 
ttgggcttag 
tcaatgacat 
tgataccgaa 
tcagaggaaa 
ttcatataaa 
tttaaatatt 
gggcagccaa 
gctgatccag 
tggtgaagga 
gtggcttggg 
atggttttac 
atggacagat 
gacagactgc 
ctgttcagtc 
gggcaaaaac 
aagatgaggt 
cgagggcagg 
aagagatgca 
atatatgtgc 



025CIP SEQ 
gtgtgtctta 



List.txt 
atctattaat 



attgcccatg 
atttgggaaa 
cacaggaaag 
ccgggtgccc 
gtgacctgag 
cacacgagtc 
cacccgctga 
gatactagtt 
ccggctggca 
taagtctgtg 
ttcaggtgtt 
cttagggata 
ctcccacatt 
tagctgggtg 
ggcaaacgtg 
agccaaagtg 
actacaaaat 
cacccaggct 
ttattcacag 
ggttacttat 
gatatggcca 
agaaaggtag 
agaacagttg 
gggtggacag 
agatccatcc 
tgctgcttgt 
ctgagcagct 
ttttggtttc 
ggtttagcac 
ggtgcagatt 



tggtagggac 
gaaagggatt 
cagttctgct 
agcgatataa 
gaccaccctc 
ccatcgtggt 
attgtgaggc 
ggtattagta 
tgtacttttc 
tcctaatgta 
cttgcagaac 
agate tgaat 
agcttttgct 
caggtgtacc 
aactcaaaga 
gccattagac 
ggaataaacc 
caaggggtgt 
acaaagcttt 
tgtgggtctg 
gggggagatg 
aacgatatca 
acttggctgg 
agatttctgt 
cataaccttc 
tgcttcctgg 
tcaaagtgaa 
ttgttttgta 
tgttcagaga 
ctggcaagca 



aatgttgggc 
Page 



acctaacatt 
32 



gtaacagaac 
aggcctgcta 

tgggggattg 

caacagttgg 
tagttgtcac 
cagcttctgc 
ttaccaaaga 
tagattaata 
tcagaggaac 
aagatgattt 
aatgttcctt 
actggctgct 
aaactttttg 
gccttttctg 
aaatcctgct 
ggcacagagt 
acactttcca 
actacaaatg 
ttcatggagt 
tttttttctg 
agagctgaat 
agcatgttca 
ggtccatagt 
gtgcaggctt 
gcaggagatc 
tccagcatga 
gtgcacagtg 
caggttgctc 
aggtgggaag 
agtcttattt 
tgcagtgaag 
ccccgctgaa 



4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 
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gcacagcagc 
tgagcagtgc 
ggaatgggtg 
cgatctggtg 
ccttccaacc 
aggcaatatt 
atcgccttat 
agaatgaaga 
atgccgtctc 
agcacttttc 
acatcgctca 
ttgccttgtt 
aaactctcag 
ttttcatttc 
aatgaaatcc 
tagcaaagga 
aatataatcc 
ttctcttgtt 
ggctgcacgc 
gccctcagct 
ggtgcagccc 
gtgtttcttg 
aatgtttgga 
gggccattgc 
cacttacatc 
tgccttccaa 
cctcttcctc 
agaaaacaaa 
ttgtcgttgc 
tcgggtgaac 
tcaccctctg 



tccagctcca 
tgggactcac 
cccatccttg 
gtgagtagcc 
taagccatcc 
ttaaagttat 
caatgctttt 
gaatcattta 
tttgtatcga 
catctgatgt 
gactctttac 
ttagctcttc 
ctgctgagct 
acaggaatgt 
agaaataatt 
aataatttgg 
tggtacccac 
gttatcattt 
tctgcactga 
ggcacccagc 
tacaagctta 
ctgcccggaa 
aacggactgc 
tgaggacggc 
agtggatgtg 
gtgccagccc 
cttctccctt 
ataactggag 
tataggtctg 
tctccttcac 
ggaaaatggg 
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tgcagtactc acagctggtg cagccctcgg 
gaggttccat gtctttcaca ctgataatgg 
gaggtcccca aggccaggct ggctgcgtct 
agcccatggc aggagttaga gcctgatggt 
tacgattcta ggaatcatga cttgtgagtg 
aaatgttttc tccccttcct tgtttgtcaa 
ggagtctcca gtcatttttc ttacamcaaa 
atttcttgat tgaatagtag gattcagaaa 
gctgtaaggt ttctcatcat ttatcagcgt 
ggaaaaaaaa atccttatca tctacagtct 
caaaaaagct ataggtttta aaactacatc 
ttccatatgc tgcgtttgtg agaggtgcgt 
tgatgggtgc ttaagaatga agcactcact 
tttagtggca ttgtttttat aactacatat 
atgcaaactc actgcatccg ttgcacaggt 
ggatggcaaa aacattcctt cagacatcta 
ccacttcatc cctcattatg ttcacactca 
gatagcgttt tctttggttc tttgccacgc 
tcagcaagta gatgcgaggg aagcagcagt 
cgctcagcct aggaggggac cttgcctttc 
cacgtgctgc gagcaggtga gcaaagggag 
gcaaaacttt actttcattc attccccttg 
tttacgttca' atttctctct tccctttaag 
atcggggccc cctggaccaa atctgtggca 
ggatctgcgc ctgtaatgtg tccttctgaa 
cacagccccc agcccctccc tgtgctgctc 
tgctgtttgt gctcgggtag aaatcatgaa 
tggaaaccca ggtgatgcag ttcattcagc 
tatcagagat gctarcacca ctttgctgtc 
tcgcatcatt tgcgggcctt atttacatcc 
cgcactggat ctctaatgga agactttccc 
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ctccagggtc 
tccaatttct 
ccgagcagcc 
ctttaaggtc 
tgtattgcag 
agttatcttg 
aagaggagga 
gctgtacgta 
ggtacatatc 
ctgtacctaa 
tgctgataat 
ggatgggcct 
gctgaaactg 
tcctcagata 
ctttatctgc 
tatttaaagg 
gagatactca 
tctgggctat 
gagaggggct 
caccagctga 
tcttcatggt 
aagaatgagg 
gctcagccag 
cagatggttt 
ggaaggaacg 
caattcatct 
gatttagaag 
tgtcataggt 
ggtgcttaac 
ccagcatcca 
tctttcagag 



6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 
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cctgtgggat gtgcagtgac aagaaacgtg gaggggctga gcagcagcac tgcccccagg 8760 

gagcaggagc ggatgccatc ggtggcagca tcccaaatga tgtcagcgga tgctgagcag 8820 

gcagcggacg aacggacaga agcgatgcgt acaccttctg ttgacatggt atttggcagc 8880 

gatttaacac tcgcttccta gtcctgctat tctccacagg ctgcattcaa atgaacgaag 8940 

ggaagggagg caaaaagatg caaaatccga gacaagcagc agaaatattt cttcgctacg 9000 

gaagcgtgcg caaacaacct tctccaacag caccagaaga gcacagcgta acctttttca 9060 

agaccagaaa aggaaattca caaagcctct gtggatacca gcgcgttcag ctctcctgat 9120 

agcagatttc ttgtcaggtt gcgaatgggg tatggtgcca ggaggtgcag ggaccatatg 9180 

atcatataca gcacagcagt cattgtgcat gtattaatat atattgagta gcagtgttac 9240 

tttgccaaag caatagttca gagatgagtc ctgctgcata cctctatctt aaaactaact 9300 

tataaatagt aaaaccttct cagttcagcc acgtgctcct ctctgtcagc accaatggtg 9360 

cttcgcctgc acccagctgc aaggaatcag cccgtgatct cattaacact cagctctgca 9420 

ggataaatta gattgttcca ctctcttttg ttgttaatta cgacggaaca attgttcagt 9480 

gctgatggtc ctaattgtca gctacagaaa acgtctccat gcagttcctt ctgcgccagc 9540 

aaactgtcca ggctatagca ccgtgatgca tgctacctct cactccatcc ttcttctctt 9600 

tcccaccagg gagagctgtg tgttttcact ctcagccact ctgaacaata ccaaactgct 9660 

acgcactgcc tccctcggaa agagaatccc cttgttgctt ttttatttac aggatccttc 9720 

ttaaaaagca gaccatcatt cactgcaaac ccagagcttc atgcctctcc ttccacaacc 9780 

gaaaacagcc ggcttcattt gtctttttta aatgctgttt tccaggtgaa ttttggccag 9840 

cgtgttggct gagatccagg agcacgtgtc agctttctgc tctcattgct cctgttctgc 9900 

attgcctctt tctggggttt ccaagagggg gggagacttt gcgcggggat gagataatgc 9960 

cccttttctt agggtggctg ctgggcagca gagtggctct gggtcactgt ggcaccaatg 10020 

ggaggcacca gtgggggtgt gttttgtgca ggggggaagc attcacagaa tggggctgat 10080 

cctgaagctt gcagtccaag gctttgtctg tgtacccagt gaaatccttc ctctgttaca 10140 

taaagcccag ataggactca gaaatgtagt cattccagcc cccctcttcc tcagatctgg 10200 

agcagcactt gtttgcagcc agtcctcccc aaaatgcaca gacctcgccg agtggaggga 10260 

gatgtaaaca gcgaaggtta attacctcct tgtcaaaaac actttgtggt ccatagatgt 10320 

ttctgtcaat cttacaaaac agaaccgaga ggcagcgagc actgaagagc gtgttcccat 10380 

gctgagttaa tgagacttgg cagctcgctg tgcagagatg atccctgtgc ttcatgggag 10440 

gctgtaacct gtctccccat cgccttcaca ccgcagtgct gtcctggaca cctcaccctc 10500 

cataagctgt aggatgcagc tgcccaggga tcaagagact tttcctaagg ctcttaggac 10560 

tcatctttgc cgctcagtag cgtgcagcaa ttactcatcc caactatact gaatgggttt 10620 
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ctgccagctc tgcttgtttg tcaataagca tttcttcatt ttgcctctaa gtttctctca 10680 

gcagcaccgc tctgggtgac ctgagtggcc acctggaacc cgaggggcac agccaccacc 10740 

tccctgttgc tgctgctcca gggactcatg tgctgctgga tggggggaag catgaagttc 10800 

ctcacccaga cacctgggtt gcaatggctg cagcgtgctc ttcttggtat gcagattgtt 10860 

tccagccatt acttgtagaa atgtgctgtg gaagcccttt gtatctcttt ctgtggccct 10920 

tcagcaaaag ctgtgggaaa gctctgaggc tgctttcttg ggtcgtggag gaattgtatg 10980 

ttccttcttt aacaaaaatt atccttagga gagagcactg tgcaagcatt gtgcacataa 11040 

aacaattcag gttgaaaggg ctctctggag gtttccagcc tgactactgc tcgaagcaag 11100 

gccaggttca aagatggctc aggatgctgt gtgccttcct gattatctgt gccaccaatg 11160 

gaggagattc acagccactc tgcttcccgt gccactcatg gagaggaata ttcccttata 11220 

ttcagataga atgttatcct ttagctcagc cttccctata accccatgag ggagctgcag 11280 

atccccatac tctccccttc tctggggtga aggccgtgtc ccccagcccc ccttcccacc 11340 

ctgtgcccta agcagcccgc tggcctctgc tggatgtgtg cctatatgtc aatgcctgtc 11400 

cttgcagtcc agcctgggac atttaattca tcaccagggt aatgtggaac tgtgtcatct 11460 

tcccctgcag ggtacaaagt tctgcacggg gtcctttcgg ttcaggaaaa ccttcactgg 11520 

tgctacctga atcaagctct atttaataag ttcataagca catggatgtg ttttcctaga 11580 

gatacgtttt aatggtatca gtgattttta tttgctttgt tgcttacttc aaacagtgcc 11640 

tttgggcagg aggtgaggga cgggtctgcc gttggctctg cagtgatttc tccaggcgtg 11700 

tggctcaggt cagatagtgg tcactctgtg gccagaagaa ggacaaagat ggaaattgca 11760 

gattgagtca cgttaagcag gcatcttgga gtgatttgag gcagtttcat gaaagagcta 11820 

cgaccactta ttgttgtttt ccccttttac aacagaagtt ttcatcaaaa taacgtggca 11880 

aagcccagga atgtttggga aaagtgtagt taaatgtttt gtaattcatt tgtcggagtg 11940 

ctaccagcta agaaaaaagt cctacctttg gtatggtagt cctgcagaga atacaacatc 12000 

aatattagtt tggaaaaaaa caccaccacc accagaaact gtaatggaaa atgtaaacca 12060 

agaaattcct tgggtaagag agaaaggatg tcgtatactg gccaagtcct gcccagctgt 12120 

cagcctgctg accctctgca gttcaggacc atgaaacgtg gcactgtaag acgtgtcccc 12180 

tgcctttgct tgcccacaga tctctgccct tgtgctgact cctgcacaca agagcatttc 12240 

cctgtagcca aacagcgatt agccataagc tgcacctgac tttgaggatt aagagtttgc 12300 

aattaagtgg attgcagcag gagatcagtg gcagggttgc agatgaaatc cttttctagg 12360 

ggtagctaag ggctgagcaa cctgtcctac agcacaagcc aaaccagcca agggttttcc 12420 

tgtgctgttc acagaggcag ggccagctgg agctggagga ggttgtgctg ggacccttct 12480 
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ccctgtgctg 


agaatggagt 


gatttctggg 


0?SrTP SFf) 

\J d— -V v_. J- 1 

tgctgttcct 


1 1 <;1" 

1— 1 ^ 1- ■ LA 1. 

gtggcttgca 


ctgagcagct 


12540 


caagggagat 


cggtgctcct 


catgcagtgc 


caaaactcgt 


gtttgatgca 


gaaagatgga 


12600 


tgtgcacctc 


cctcctgcta 


atgcagccgt 


gagcttatga 


aggcaatgag 


ccctcagtgc 


12660 


agcaggagct 


gtagtgcact 


cctgtaggtg 


ctagggaaaa 


tctctggttc 


ccagggatgc 


12720 


attcataagg gcaatatatc 


ttgaggctgc 


gccaaatctt 


tctgaaatat 


tcatgcgtgt 


12780 


tcccttaatt 


tatagaaaca 


aacacagcag 


aataattatt 


ccaatgcctc 


ccctcgaagg 


12840 


aaacccatat 


ttccatgtag 


aaatgtaacc 


tatatacaca 


cagccatgct 


gcatccttca 


12900 


gaacgtgcca gtgctcatct 


cccatggcaa 


aatactacag 


gtattctcac 


tatgttggac 


12960 


ctgtgaaagg 


aaccatggta 


agaaacttcg 


gttaaaggta 


tggctgcaaa 


act act cat a 


13020 


ccaaaacagc agagctccag 


acctcctctt 


aggaaagagc 


cacttggaga 


gggatggtgt 


13080 


gaaggctgga ggtgagagac 


agagcctgtc 


ccagttttcc 


tgtctctatt 


ttctgaaacg 


13140 


tttgcaggag 


gaaaggacaa 


ctgtactttc 


aggcatagct 


ggtgccctca 


cgtaaataag 


13200 


ttccccgaac 


ttctgtgtca 


tttgttctta 


agatgctttg 


gcagaacact 


ttgagtcaat 


13260 


tcgcttaact 


gtgactaggt 


ctgtaaataa 


gtgctccctg 


ctgataaggt 


tcaagtgaca 


13320 


tttttagtgg 


tatttgacag 


catttacctt 


gctttcaagt 


cttctaccaa 


gctcttctat 


13380 


acttaagcag 


tgaaaccgcc 


aagaaaccct 


tccttttatc 


aagctagtgc 


taaataccat 


13440 


taacttcata 


ggttagatac 


ggtgctgcca 


gcttcacctg 


gcagtggttg 


gtcagttctg 


13500 


ctggtgacaa 


agcctccctg 


gcctgtgctt 


ttacctagag 


gtgaatatcc 


aagaatgcag 


13560 


aactgcatgg 


aaagcagagc 


tgcaggcacg 


atggtgctga 


gccttagctg 


cttcctgctg 


13620 


ggagatgtgg 


atgcagagac 


gaatgaagga 


cctgtccctt 


actcccctca 


gcattctgtg 


13680 


ctatttaggg 


ttctaccaga 


gtccttaaga 


ggtttttttt 


-ttttttggtc 


caaaagtctg 


13740 


tttgtttggt 


tttgaccact 


gagagcatgt 


gacacttgtc 


tcaagctatt 


aaccaagtgt 


13800 


ccagccaaaa 


tcaattgcct 


gggagacgca 


gaccattacc 


tggaggtcag 


gacctcaata 


13860 


aatattacca 


gcctcattgt 


gccgctgaca 


gattcagctg 


gctgctccgt 


gttccagtcc 


13920 


aacagttcgg 


acgccacgtt 


tgtatatatt 


tgcaggcagc 


ctcgggggga 


tcatctcagg 


13980 


agcagagcac 


cggcagccgc 


ctgcagagcc 


gggcagtacc 


tcaccatggc 


tttgaccttt 


14040 


gccttactgg 


tggctctcct 


ggtgctgagc 


tgcaagagca 


gctgctctgt 


gggctgcgat 


14100 


ctgcctcaga 


cccacagcct 


gggcagcagg 


aggaccctga 


tgctgctggc 


tcagatgagg 


14160 


agaatcagcc 


tgtttagctg 


cctgaaggat 


aggcacgatt 


ttggctttcc 


tcaagaggag 


14220 


tttggcaacc 


agtttcagaa 


ggctgagacc 


atccctgtgc 


tgcacgagat 


gatccagcag 


14280 


atctttaacc 


tgtttagcac 


caaggatagc 


agcgctgctt 


gggatgagac 


cctgctggat 


14340 


aagttttaca 


ccgagctgta 


ccagcagctg 


aacgatctgg 
Page 


aggcttgcgt 
36 


gatccagggc 


14400 
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gtgggcgtga ccgagacccc tctgatgaag gaggatagca tcctggctgt gaggaagtac 14460 

tttcagagga tcaccctgta cctgaaggag aagaagtaca gcccctgcgc ttgggaagtc 14520 

gtgagggctg agatcatgag gagctttagc ctgagcacca acctgcaaga gagcttgagg 14580 

tctaaggagt aaaaagtcta gagtcggggc ggccggccgc ttcgagcaga catgataaga 14640 

tacattgatg agtttggaca aaccacaact agaatgcagt gaaaaaaatg ctttatttgt 14700 

gaaatttgtg atgctattgc tttatttgta accattataa gctgcaataa acaagttaac 14760 

aacaacaatt gcattcattt tatgtttcag gttcaggggg aggtgtggga ggttttttaa 14820 

agcaagtaaa acctctacaa atgtggtaaa atcgataccg tcgacctcga ctagagcggc 14880 

cactaacata cgctctccat caaaacaaaa cgaaacaaaa caaactagca aaataggctg 14940 

tccccagtgc aagtgcaggt gccagaacat ttctctatcg ataggtaccg agctcttacg 15000 

cgtgctagcc ctcgagcagg atctatacat tgaatcaata ttggcaatta gccatattag 15060 

tcattggtta tatagcataa atcaatattg gctattggcc attgcatacg ttgtatctat 15120 

atcataatat gtacatttat attggctcat gtccaatatg accgccatgt tgacattgat 15180 

tattgactag ttattaatag taatcaatta cggggtcatt agttcatagc ccatatatgg 15240 

agttccgcgt tacataactt acggtaaatg gcccgcctgg ctgaccgccc aacgaccccc 15300 

gcccattgac gtcaataatg acgtatgttc ccatagtaac gccaataggg actttccatt 15360 

gacgtcaatg ggtggagtat ttacggtaaa ctgcccactt ggcagtacat caagtgtatc 15420 

atatgccaag tccgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 15480 

cccagtacat gaccttacgg gactttccta cttggcagta catctacgta ttagtcatcg 15540 

ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 15600 

cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 15660 

atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 15720 

ggcgtgtacg gtgggaggtc tatataagca gagctcgttt agtgaaccgt cagatcgcct 15780 

ggagacgcca tccacgctgt tttgacctcc atagaagaca ccgggaccga tccagcctcc 15840 

cctcgaagct cgactctagg ggctcgagat ctgcgatcta agtaagcttg catgcctgca 15900 

ggtcggccgc cacgaccggt gccgccacca tcccctgacc cacgcccctg acccctcaca 15960 

aggagacgac cttccatgac cgagtacaag cccacggtgc gcctcgccac ccgcgacgac 16020 

gtcccccggg ccgtacgcac cctcgccgcc gcgttcgccg actaccccgc cacgcgccac 16080 

accgtcgacc cggaccgcca catcgagcgg gtcaccgagc tgcaagaact cttcctcacg 16140 

cgcgtcgggc tcgacatcgg caaggtgtgg gtcgcggacg acggcgccgc ggtggcggtc 16200 

tggaccacgc cggagagcgt cgaagcgggg gcggtgttcg ccgagatcgg cccgcgcatg 16260 
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gccgagttga 


gcggttcccg 


gctggccgcg 


025CIP SEQ 
cagcaacaga 


List.txt 
tggaaggcct 


cctggcgccg 


16320 


caccggccca 


aggagcccgc 


gtggttcctg 


gccaccgtcg 


gcgtctcgcc 


cgaccaccag 


16380 


ggcaagggtc 


tgggcagcgc 


cgtcgtgctc 


cccggagtgg 


aggcggccga 


gcgcgccggg 


16440 


gtgcccgcct 


tcctggagac 


ctccgcgccc 


cgcaacctcc 


ccttctacga 


gcggctcggc 


16500 


ttcaccgtca 


ccgccgacgt 


cgaggtgccc 


gaaggaccgc 


gcacctggtg 


catgacccgc 


16560 


aagcccggtg 


cctgacgccc 


gccccacgac 


ccgcagcgcc 


cgaccgaaag 


gagcgcacga 


16620 


ccccatggct 


ccgaccgaag 


ccgacccggg 


cggccccgcc 


gaccccgcac 


ccgcccccga 


16680 


ggcccaccga 


ctctagagtc 


ggggcggccg 


gccgcttcga 


gcagacatga 


taagatacat 


16740 


tgatgagttt 


ggacaaacca 


caactagaat 


gcagtgaaaa 


aaatgcttta 


tttgtgaaat 


16800 


ttgtgatgct 


attgctttat 


ttgtaaccat 


tataagctgc 


aataaacaag 


ttaacaacaa 


16860 


caattgcatt 


cattttatgt 


ttcaggttca 


gggggaggtg 


tgggaggttt 


tttaaagcaa 


16920 


gtaaaacctc 


tacaaatgtg 


gtaaaatcga 


taaggatcaa 


ttcggcttca 


ggtaccgtcg 


16980 


acgatgtagg 


tcacggtctc 


gaagccgcgg 


tgcgggtgcc 


agggcgtgcc 


cttgggctcc 


17040 


ccgggcgcgt 


actccacctc 


acccatctgg 


tccatcatga 


tgaacgggtc 


gaggtggcgg 


17100 


tagttgatcc 


cggcgaacgc 


gcggcgcacc 


gggaagccct 


cgccctcgaa 


accgctgggc 


17160 


gcggtggtca 


cggtgagcac 


gggacgtgcg 


acggcgtcgg 


cgggtgcgga. 


tacgcggggc 


17220 


agcgtcagcg 


ggttctcgac 


ggtcacggcg 


ggcatgtcga 


cagccgaatt 


gatccgtcga 


17280 


ccgatgccct 


tgagagcctt 


caacccagtc 


agctccttcc 


ggtgggcgcg 


gggcatgact 


17340 


atcgtcgccg 


cacttatgac 


tgtcttcttt 


atcatgcaac 


tcgtaggaca 


ggtgccggca 


17400 














17402 


<210> 9 
<211> 5172 

<212> DNA 

<213> Plasmid pRSV-Int 










<400> 9 
ctgcattaat 


gaatcggcca 


acgcgcgggg 


agaggcggtt 


tgcgtattgg 


gcgctcttcc 


60 


gcttcctcgc tcactgactc gctgcgctcg 


gtcgttcggc 


tgcggcgagc 


ggtatcagct 


120 


cactcaaagg 


cggtaatacg 


gttatccaca 


gaatcagggg 


ataacgcagg 


aaagaacatg 


180 


tgagcaaaag 


gccagcaaaa 


ggccaggaac 


cgtaaaaagg 


ccgcgttgct 


ggcgtttttc 


240 


cataggctcc 


gcccccctga 


cgagcatcac 


aaaaatcgac 


gctcaagtca 


gaggtggcga 


300 


aacccgacag 


gactataaag 


ataccaggcg 


tttccccctg 


gaagctccct 


cgtgcgctct 


360 


cctgttccga 


ccctgccgct 


taccggatac 


ctgtccgcct 


ttctcccttc 


gggaagcgtg 


420 


gcgctttctc 


aatgctcacg 


ctgtaggtat 


ctcagttcgg 


tgtaggtcgt 


tcgctccaag 


480 
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ctgggctgtg 


tgcacgaacc 


ccccgttcag 


025CIP SEQ 
cccgaccgct 


List.txt 
gcgccttatc 


cggtaactat 


540 


cgtcttgagt 


ccaacccggt 


aagacacgac 


ttatcgccac 


tggcagcagc 


cactggtaac 


600 


aggattagca 


gagcgaggta 


tgtaggcggt 


gctacagagt 


tcttgaagtg 


gtggcctaac 


660 


tacggctaca 


ctagaaggac 


agtatttggt 


atctgcgctc 


tgctgaagcc 


agttaccttc 


720 


ggaaaaagag 


ttggtagctc 


ttgatccggc 


aaacaaacca 


ccgctggtag 


cggtggtttt 


780 


tttgtttgca 


agcagcagat 


tacgcgcaga 


aaaaaaggat 


ctcaagaaga 


tcctttgatc 


840 


ttttctacgg 


ggtctgacgc 


tcagtggaac 


gaaaactcac 


gttaagggat 


tttggtcatg 


900 


agattatcaa 


aaaggatctt 


cacctagatc 


cttttaaatt 


aaaaatgaag 


ttttaaatca 


960 


atctaaagta 


tatatgagta 


aacttggtct 


gacagttacc 


aatgcttaat 


cagtgaggca 


1020 


cctatctcag 


cgatctgtct 


atttcgttca 


tccatagttg 


cctgactccc 


cgtcgtgtag 


1080 


ataactacga 


tacgggaggg 


cttaccatct 


ggccccagtg 


ctgcaatgat 


accgcgagac 


1140 


ccacgctcac 


cggctccaga 


tttatcagca 


ataaaccagc 


cagccggaag 


ggccgagcgc 


1200 


agaagtggtc 


ctgcaacttt 


atccgcctcc 


atccagtcta 


ttaattgttg 


ccgggaagct 


1260 


agagtaagta 


gttcgccagt 


taatagtttg 


cgcaacgttg 


ttgccattgc 


tacaggcatc 


1320 


gtggtgtcac 


gctcgtcgtt 


tggtatggct 


tcattcagct 


ccggttccca 


acgatcaagg 


1380 


cgagttacat 


gatcccccat 


gttgtgcaaa 


aaagcggtta 


gctccttcgg 


tcctccgatc 


1440 


gttgtcagaa 


gtaagttggc 


cgcagtgtta 


tcactcatgg 


ttatggcagc 


actgcataat 


1500 


tctcttactg 


tcatgccatc 


cgtaagatgc 


ttttctgtga 


ctggtgagta 


ctcaaccaag 


1560 


tcattctgag 


aatagtgtat 


gcggcgaccg 


agttgctctt 


gcccggcgtc 


aatacgggat 


1620 


aataccgcgc 


cacatagcag 


aactttaaaa 


gtgctcatca 


ttggaaaacg 


ttcttcgggg 


1680 


cgaaaactct 


caaggatctt 


accgctgttg 


agatccagtt 


cgatgtaacc 


cactcgtgca 


1740 


cccaactgat 


cttcagcatc 


ttttactttc 


accagcgttt 


ctgggtgagc 


aaaaacagga 


1800 


aggcaaaatg 


ccgcaaaaaa 


gggaataagg 


gcgacacgga 


aatgttgaat 


actcatactc 


1860 


ttcctttttc 


aatattattg 


aagcatttat 


cagggttatt 


gtctcatgag 


cggatacata 


1920 


tttgaatgta 


tttagaaaaa 


taaacaaata 


ggggttccgc 


gcacatttcc 


ccgaaaagtg 


1980 


ccacctgacg 


tcgacggatc 


gggagatctc 


ccgatcccct 


atggtcgact 


ctcagtacaa 


2040 


tctgctctga 


tgccgcatag 


ttaagccagt 


atctgctccc 


tgcttgtgtg 


ttggaggtcg 


2100 


ctgagtagtg 


cgcgagcaaa 


atttaagcta 


caacaaggca 


aggcttgacc 


gacaattgca 


2160 


u y ci ct y dd L u T. 


get. ragggri. 


aggcguT.x.i.g 


cgcx.gci.xcg 


cgaiigxacgg 


gccagaTaiia 




cgcgtgctag 


gggtctagga 


tcgattctag 


gaattctcta 


gccgcggtct 


agggatcccg 


2280 


gcgcgtatgg 


tgcactctca 


gtacaatctg 


ctctgatgcc 


gcatagttaa 


gccagtatct 


2340 


gctccctgct 


tgtgtgttgg 


aggtcgctga 


gtagtgcgcg 
Page 


agcaaaattt 
39 


aagctacaac 


2400 
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aaggcaaggc 

gcttcgcgat 

aaaagcgggg 

ttgcataggg 

acgatgagtt 

gaagtaaggt 

ggacgaacca 

aataaacgcc 

ggtaccggtc 

acaaggggtt 

gcgcgagaat 

ggccgacctt 

cagcgaagcg 

cctgaacgaa 

ctcgcgcctg 

gacgattgtt 

cctgattatg 

cgacacgaag 

cttcgagctt 

catcaacaag 

cgtaatccgg 

cagtcaagcc 

cgccgtgccg 

ggcaaccgtt 

ctacaagaag 

cgacccgatc 

gtggtatgag 

gcaagccatt 

gaagcgcggg 

cccgtccgca 

gttcgttgcg 



ttgaccgaca 

gtacgggcca 

cttcggttgt 

agggggaaat 

agcaacatgc 

ggtacgatcg 

ctgaattccg 

atttgaccat 

cggaattccc 

gtgaccgggg 

tcgagcgcag 

cagcgcgaag 

ccgggcacgt 

tgccgcgccg 

aaggtcatgg 

tccactcagg 

cggctcgacg 

aaccttcagc 

gtttcggaga 

cttgcgcact 

tggtggtggc 

gccattcacc 

acccggggcg 

atgcgaatcc 

aagccggacg 

acgctccggc 

cttcaggcgt 

ctgtccgcca 

gaagaatcga 

cctgggcagc 

gaacgcatct 
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attgcatgaa gaatctgctt agggttaggc 
gatatacgcg tatctgaggg gactagggtg 
acgcggttag gagtcccctc aggatatagt 
gtagtcttat gcaatacact tgtagtcttg 
cttacaagga gagaaaaagc accgtgcatg 
tgccttatta ggaaggcaac agacaggtct 
cattgcagag ataattgtat ttaagtgcct 
tcaccacatt ggtgtgcacc tccaagcttg 
gggtcgacga gctcactagt cgtagggtcg 
tggacacgta cgcgggtgct tacgaccgtc 
caagcccagc gacacagcgt agcgccaacg 
tcgagcgcga cgggggccgg ttcaggttcg 
cggcgttcgg gacggcggag cgcccggagt 
ggcggctcaa catgatcatt gtctatgacg 
acgcgattcc gattgtctcg gaattgctcg 
aaggcgtctt ccggcaggga aacgtcatgg 
cgtcgcacaa agaatcttcg ctgaagtcgg 
gcgaattggg cgggtacgtc ggcgggaagg 
cgaaggagat cacgcgcaac ggccgaatgg 
cgaccactcc ccttaccgga cccttcgagt 
gtgagatcaa gacgcacaaa caccttccct 
cgggcagcat cacggggctt tgtaagcgca 
agacgattgg gaagaagacc gcttcaagcg 
ttcgggaccc gcgtattgcg ggcttcgccg 
gcacgccgac cacgaagatt gagggttacc 
cggtcgagct tgattgcgga ccgatcatcg 
ggttggacgg cagggggcgc ggcaaggggc 
tggacaagct gtactgcgag tgtggcgccg 
tcaaggactc ttaccgctgc cgtcgccgga 
acgaaggcac gtgcaacgtc agcatggcgg 
tcaacaagat caggcacgcc gaaggcgacg 
Page 40 



gttttgcgct 

tgtttaggcg 

agtttcgctt 

caacatggta 

ccgattggtg 

gacatggatt 

agctcgatac 

catgcctgca 

ccgacatgac 

agtcgcgcga 

aagacaaggc 

tcgggcattt 

tcgaacgcat 

tgtcgcgctt 

ccctgggcgt 

acctgattca 

cgaagattct 

cgccttacgg 

tcaatgtcgt 

tcgagcccga 

tcaagccggg 

tggacgctga 

cctgggaccc 

ctgaggtgat 

gcattcagcg 

agcccgctga 

tttcccgggg 

tcatgacttc 

aggtggtcga 

cactcgacaa 

aagagacgtt 



2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 
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ggcgcttctg 


tgggaagccg 


1 

cccgacgctt 


cggcaagctc 


actgaggcgc 


ctgagaagag 


4320 


cggcgaacgg 


gcgaaccttg 


ttgcggagcg 


cgccgacgcc 


ctgaacgccc 


ttgaagagct 


4380 


gtacgaagac 


cgcgcggcag 


gcgcgtacga 


cggacccgtt 


ggcaggaagc 


acttccggaa 


4440 


gcaacaggca 


gcgctgacgc 


tccggcagca 


aggggcggaa 


gagcggcttg 


ccgaacttga 


4500 


agccgccgaa 


gccccgaagc 


ttccccttga 


ccaatggttc 


cccgaagacg 


ccgacgctga 


4560 


cccgaccggc 


cctaagtcgt 


ggtgggggcg 


cgcgtcagta gacgacaagc gcgtgttcgt 


4620 


cgggctcttc 


gtagacaaga 


tcgttgtcac 


gaagtcgact 


acgggcaggg 


ggcagggaac 


4680 


gcccatcgag 


aagcgcgctt 


cgatcacgtg 


ggcgaagccg 


ccgaccgacg 


acgacgaaga 


4740 


cgacgcccag 


gacggcacgg 


aagacgtagc 


ggcgtagcga gacacccgga tccctcgagg 


4800 


ggccctattc 


tatagtgtca 


cctaaatgct 


agagctcgct 


gatcagcctc 


gactgtgcct 


4860 


tctagttgcc 


agccatctgt 


tgtttgcccc 


tcccccgtgc 


cttccttgac 


cctggaaggt 


4920 


gccactccca 


ctgtcctttc 


ctaataaaat 


gaggaaattg 


catcgcattg 


tctgagtagg 


4980 


tgtcattcta 


ttctgggggg 


tggggtgggg 


caggacagca 


agggggagga 


ttgggaagac 


5040 


aatagcaggc 


atgctgggga 


tgcggtgggc 


tctatggctt 


ctgaggcgga 


aagaaccagg 


5100 


tgcccagtca 


tagccgaata 


gcctctccac 


ccaagcggcc 


ggagaacctg 


cgtgcaatcc 


5160 


actgggggcg 


eg 










5172 


<210> 10 
<211> 6233 

<212> DNA 

<213> Plasmid pCR-XL- 


-TOPO-CMV-pur-attB 








<400> 10 
agcgcccaat 


acgcaaaccg 


cctctccccg 


cgcgttggcc 


gattcattaa 


tgcagctggc 


60 


acgacaggtt 


tcccgactgg 


aaagcgggca 


gtgagcgcaa 


cgcaattaat 


gtgagttagc 


120 


tcactcatta 


ggcaccccag 


gctttacact 


ttatgcttcc 


ggctcgtatg 


ttgtgtggaa 


180 


ttgtgagcgg 


ataacaattt 


cacacaggaa 


acagctatga 


ccatgattac 


gccaagctat 


240 


ttaggtgacg 


cgttagaata 


ctcaagctat 


gcatcaagct 


tggtaccgag 


ctcggatcca 


300 


ctagtaacgg 


ccgccagtgt 


gctggaattc 


gcccttggcc gcaataaaat 


atctttattt 


360 


tcattacatc 


tgtgtgttgg 


ttttttgtgl: 


gaatcgatag tactaacata 


cgctctccat 


420 


caaaacaaaa 


cgaaacaaaa 


caaactagca 


aaataggctg 


tccccagtgc 


aagtgcaggt 


480 


gccagaacat 


ttctctatcg 


ataggtaccg 


agctcttacg 


cgtgctagcc 


ctcgagcagg 


540 


atctatacat 


tgaatcaata 


ttggcaatta 


gccatattag 


tcattggtta 


tatagcataa 


600 


atcaatattg 


gctattggcc 


attgcatacg 


ttgtatctat 


atcataatat 


gtacatttat 


660 


attggctcat 


gtccaatatg 


accgccatgt 


tgacattgat 


tattgactag ttattaatag 


720 
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taatcaatta 


cggggtcatt 


agttcatagc 


025C1P SEQ 
ccatatatgg 


Li St . txt 
agttccgcgt 


tacataactt 


780 


acggtaaatg 


gcccgcctgg 


ctgaccgccc 


aacgaccccc 


gcccattgac 


gtcaataatg 


840 


acgtatgttc 


ccatagtaac 


gccaataggg 


actttccatt 


gacgtcaatg 


ggtggagtat 


900 


ttacggtaaa 


ctgcccactt 


ggcagtacat 


caagtgtatc 


atatgccaag 


tccgccccct 


960 


attgacgtca 


atgacggtaa 


atggcccgcc 


tggcattatg 


cccagtacat 


gaccttacgg 


1020 


gactttccta 


cttggcagta 


catctacgta 


ttagtcatcg 


ctattaccat 


ggtgatgcgg 


1080 


ttttggcagt 


acatcaatgg 


gcgtggatag 


cggtttgact 


cacggggatt 


tccaagtctc 


1140 


caccccattg 


acgtcaatgg 


gagtttgttt 


tggcaccaaa 


atcaacggga 


ctttccaaaa 


1200 


tgtcgtaaca 


actccgcccc 


attgacgcaa 


atgggcggta 


ggcgtgtacg 


gtgggaggtc 


1260 


tatataagca 


gagctcgttt 


agtgaaccgt 


cagatcgcct 


ggagacgcca 


tccacgctgt 


1320 


tttgacctcc 


atagaagaca 


ccgggaccga 


tccagcctcc 


cctcgaagct 


cgactctagg 


1380 


ggctcgagat 


ctgcgatcta 


agtaagcttg 


catgcctgca 


ggtcggccgc 


cacgaccggt 


1440 


gccgccacca 


tcccctgacc 


cacgcccctg 


acccctcaca 


aggagacgac 


cttccatgac 


1500 


cgagtacaag 


cccacggtgc 


gcctcgccac 


ccgcgacgac 


gtcccccggg 


ccgtacgcac 


1560 


cctcgccgcc 


gcgttcgccg 


actaccccgc 


cacgcgccac 


accgtcgacc 


cggaccgcca 


1620 


catcgagcgg 


gtcaccgagc 


tgcaagaact 


cttcctcacg 


cgcgtcgggc 


tcgacatcgg 


1680 


caaggtgtgg 


gtcgcggacg 


acggcgccgc 


ggtggcggtc 


tggaccacgc 


cggagagcgt 


1740 


cgaagcgggg 


gcggtgttcg 


ccgagatcgg 


cccgcgcatg 


gccgagttga 


gcggttcccg 


1800 


gctggccgcg 


cagcaacaga 


tggaaggcct 


cctggcgccg 


caccggccca 


aggagcccgc 


1860 


gtggttcctg 


gccaccgtcg 


gcgtctcgcc 


cgaccaccag 


ggcaagggtc 


tgggcagcgc 


1920 


cgtcgtgctc 


cccggagtgg 


aggcggccga 


gcgcgccggg 


gtgcccgcct 


tcctggagac 


1980 


ctccgcgccc 


cgcaacctcc 


ccttctacga 


gcggctcggc 


ttcaccgtca 


ccgccgacgt 


2040 


cgaggtgccc 


gaaggaccgc 


gcacctggtg 


catgacccgc 


aagcccggtg 


cctgacgccc 


2100 


gccccacgac 


ccgcagcgcc 


cgaccgaaag 


gagcgcacga 


ccccatggct 


ccgaccgaag 


2160 


ccgacccggg 


cggccccgcc 


gaccccgcac 


ccgcccccga 


ggcccaccga 


ctctagagtc 


2220 


ggggcggccg 


gccgcttcga 


gcagacatga 


taagatacat 


tgatgagttt 


ggacaaacca 


2280 


caactagaat 


gcagtgaaaa 


aaatgcttta 


tttgtgaaat 


ttgtgatgct 


attgctttat 


2340 


ttgtaaccat 


tataagctgc 


aataaacaag 


ttaacaacaa 


caattgcatt 


cattttatgt 


2400 


XTZcaygLtca 


gggggaggtg 


rgggaggrtt: 


"tttaaagcaa 


gtaaaacctc 


tacaaatgtg 


2460 


gtaaaatcga 


taaggatcaa 


ttcggcttca 


ggtaccgtcg 


acgatgtagg 


tcacggtctc 


2520 


gaagccgcgg 


tgcgggtgcc 


agggcgtgcc 


cttgggctcc 


ccgggcgcgt 


actccacctc 


2580 


acccatctgg 


tccatcatga 


tgaacgggtc 


gaggtggcgg 
Page 


tagttgatcc 
42 


cggcgaacgc 


2640 
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gcggcgcacc 


gggaagccct 


cgccctcgaa 


accqctqqqc 


qcqqtqqtca 


cqqtqaacac 


2700 


gggacgtgcg 


acggcgtcgg 


cqqqtqcqqa 


tacqcaqqgc 


aqcqtcaqcq 


ggttctcgac 


2760 


ggtcacggcg 


qqcatqtcaa 


cagccgaatt 


ga°tccg°tcga 


ccgaligccct 


v^y viy V* 


2820 


caacccag'tc 


agctccttcc 






a'tcQ'tcacca 






tgtcttcttt 


atLcatgcaac 


tcgtaggaca 




acactcttcc 


ac'tt'cct'rac 




tcactgactc 


actacQctca 






□□'tatcaact 


f"af iT'a a a nn 

\. v» c^cLciy y 




cggtaatacg 


gt:tat:ccaca 


aaatcaaaaa 


aliaacacaaa 


aaaaaacalia 


aannnf"n;iat* 
^■^yyy^y w<* i- 




tctgcagata 


tccatcacac 


t:aacaaccac 


tcgagcatgc 


atctaaaaaa 


VaV-\_.Gl%l L la, VjVJ^j 


3120 


cctatagtga 


gtcgtattac 


aaHcac'tgg 


ccatcatttt 




oa piTinrmaa 
y**^ Lyyy acici 


^ J-Ovl 


accctggcgt 


tacccaactt 


aatcacctta 

W ^3 ^* *• 2J 


caacaca'tcc 


ccctttcacr 


ciy ^ ^yy^y 




atagcgaaga 


ggcccgcacc 


OlSX-CQCCCXX. 

%^%At W Vf %■ 


cccaacaatt 


y^lJ^Giy VaVa laCl 






agtttaaggt 


ttacacctat 


aaaagagaga 


□ccQ't'ta'tca 


^Va lay la L lal^ 


n a n 1" a a n ^ 
yci. Ly (.ACciyci 




gtgatattat 


t:aacacacca 


aaacaacaaa 

^y^^yyo- 


tioa'taa'tccc 


cctoaccaat 


y ^cii-y u«» Ly ^ 




tgtcagataa 


agtctcccgt 


gaactt'tacc 


caataa'Caca 


'tatcaaaaa't 


y»*****y v-yy^ 


3480 


gcatgatgac 


caccgatatg 


accaqtatac 


cggtctccgi: 


talicaqaaaa 


aa a n t" o n r t" o 
yt**-*-y *-y y ^ *-y 


3540 


atctcagcca 


ccgcgaaaat 


gacatcaaaa 


acgccaftaa 


cctaatattc 


'taaaaaa 1'at* 
*-yy y y«w- i.« *- 




aaatgtcagg 


catgagatt.a 


"tcaaaaagga 


tcttcaccta 


aatccttt'tc 

V* Vi^>^ V Va w 


o,\^\^ uci^ciciciy 


3f)f^n 


ccagtccgca 


gaaacgqtqc 


tgaccccgga 


tgaatgtcag 




Q. l_ l_ ^ d V. CICa. 


^770 


gggaaaacgc 


aagcgcaaag 


agaaagcagg 


taacttacaa 




•-yy ^y«- Lay 




taqactqqqc 


qqttttatqq 


acaqcaaaca 


aaccaaaatl: 




y L-y i_L-L. LL. Ly 


JO*tL/ 


qtaaqqttqq 


qaaqccctqc 


aaagtaaact 


qqataacttt 


^ ^y X— y »^ Vaa d 


ay y ci. uv. Ly a. l 




qqcqcagqqq 


atcaagctct 


ga'tcaagaga 


caggaxgagg 


atcatttcar 

V* ^ Var V. v. ^ V- Vaa. 






aagatggatt 


qcacqcaqqt 


tctccqqccq 


cttqqatqaa 

^ ^ ^9 9 9 99^ 


aaaactatitic 




4070 


gggcacaaca 


gacaatcggc 


tgctctgatg 


ccgccgtgtt 


ccaoctatca 


yL.yL.ayyyyL- 


4080 


gcccggttct 


ttttgtcaag 


accgacctgt 


ccggtgccct 


aaataaacta 

^* V^ ^*J*A VC V^ V. VJ 


caaaacaaaa 


4140 


cagcgcggct 


atcgtggctg 


qccacaacaa 


acattcctta 




ctcaacal'ta 


4200 


tcactgaagc 


gggaagggac 


tggctgctat 


taaqcqaaat 


y w^yyyy vauy 


aatctcctat 


4260 


Co. L^clL.C L 


J 1 1 ' f ^ 

Lgc kcc Lgcc 


gagaaagrax 


ccarcaTggc 


tgatgcaatg 


cggcggctgc 


4320 


atacgcttga 


tccggctacc 


tgcccattcg 


accaccaagc 


gaaacatcgc 


atcgagcgag 


4380 


cacgtactcg 


gatggaagcc 


ggtcttgtcg 


atcaggatga 


tctggacgaa 


gagcatcagg 


4440 


ggctcgcgcc 


agccgaactg 


ttcgccaggc 


tcaaggcgag 


catgcccgac 


ggcgaggatc 


4500 








page 
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tcgtcgtgac ccatggcgat gcctgcttgc cgaatatcat ggtggaaaat ggccgctttt 4560 

ctggattcat cgactgtggc cggctgggtg tggcggaccg ctatcaggac atagcgttgg 4620 

ctacccgtga tattgctgaa gagcttggcg gcgaatgggc tgaccgcttc ctcgtgcttt 4680 

acggtatcgc cgctcccgat tcgcagcgca tcgccttcta tcgccttctt gacgagttct 4740 

tctgaattat taacgcttac aatttcctga tgcggtattt tctccttacg catctgtgcg 4800 

gtatttcaca ccgcatacag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg 4860 

tttatttttc taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat 4920 

gcttcaataa tagcacgtga ggagggccac catggccaag ttgaccagtg ccgttccggt 4980 

gctcaccgcg cgcgacgtcg ccggagcggt cgagttctgg accgaccggc tcgggttctc 5040 

ccgggacttc gtggaggacg acttcgccgg tgtggtccgg gacgacgtga ccctgttcat 5100 

cagcgcggtc caggaccagg tggtgccgga caacaccctg gcctgggtgt gggtgcgcgg 5160 

cctggacgag ctgtacgccg agtggtcgga ggtcgtgtcc acgaacttcc gggacgcctc 5220 

cgggccggcc atgaccgaga tcggcgagca gccgtggggg cgggagttcg ccctgcgcga 5280 

cccggccggc aactgcgtgc acttcgtggc cgaggagcag gactgacacg tgctaaaact 5340 

tcatttttaa tttaaaagga tctaggtgaa gatccttttt gataatctca tgaccaaaat 5400 

cccttaacgt gagttttcgt tccactgagc gtcagacccc gtagaaaaga tcaaaggatc 5460 

ttcttgagat cctttttttc tgcgcgtaat ctgctgcttg caaacaaaaa aaccaccgct 5520 

accagcggtg gtttgtttgc cggatcaaga gctaccaact ctttttccga aggtaactgg 5580 

cttcagcaga gcgcagatac caaatactgt ccttctagtg tagccgtagt taggccacca 5640 

cttcaagaac tctgtagcac cgcctacata cctcgctctg ctaatcctgt taccagtggc 5700 

tgctgccagt ggcgataagt cgtgtcttac cgggttggac tcaagacgat agttaccgga 5760 

taaggcgcag cggtcgggct gaacgggggg ttcgtgcaca cagcccagct tggagcgaac 5820 

gacctacacc gaactgagat acctacagcg tgagctatga gaaagcgcca cgcttcccga 5880 

agggagaaag gcggacaggt atccggtaag cggcagggtc ggaacaggag agcgcacgag 5940 

ggagcttcca gggggaaacg cctggtatct ttatagtcct gtcgggtttc gccacctctg 6000 

acttgagcgt cgatttttgt gatgctcgtc aggggggcgg agcctatgga aaaacgccag 6060 

caacgcggcc tttttacggt tcctgggctt ttgctggcct tttgctcaca tgttctttcc 6120 

tgcgttatcc cctgattctg tggataaccg tattaccgcc tttgagtgag ctgataccgc 6180 

tcgccgcagc cgaacgaccg agcgcagcga gtcagtgagc gaggaagcgg aag 6233 

<210> 11 
<211> 234 
<212> DNA 
<213> artificial 
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<220> 

<223> attp containing polynucleotide 
<400> 11 

gactagtact gacggacaca ccgaagcccc ggcggcaacc ctcagcggat gccccggggc 
ttcacgtttt cccaggtcag aagcggtttt cgggagtagt gccccaactg gggtaacctt 
tgagttctct cagttggggg cgtagggtcg ccgacatgac acaaggggtt gtgaccgggg 
tggacacgta cgcgggtgct tacgaccgtc agtcgcgcga gcgcgactag taca 



<210> 12 

<211> 26 

<212> DNA 

<213> artificial 

<220> 

<223> Primer attB-for 

<400> 12 

taccgtcgac gatgtaggtc acggtc 26 
<400> 13 

cys Gly Gly Pro Lys Lys Lys Arg Lys val Gly 
1 5 10 



<210> 13 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> LysOSl 



60 
120 
180 
234 
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